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Abstract 



The theory of decoherence aims to explain how macroscopic quantum objects 
become effectively classical. Understanding this process could help in the search 
for the quantum theory underlying gravity, and suggest new schemes for preserving 
the coherence of technological quantum devices. 

The process of decoherence is best understood in terms of information flow 
within a quantum system, and between the system and its environment. We develop 
a novel way of characterizing this information, and give a sufficient condition for 
its classicality. These results generalize previous models of decoherence, clarify the 
process by which a phase-space based on non-commutative quantum variables can 
emerge, and provide a possible explanation for the universality of the phenomenon 
of decoherence. In addition, the tools developed in this approach generalize the 
theory of quantum error correction to infinite-dimensional Hilbert spaces. 

We characterize the nature of the information preserved by a quantum chan- 
nel by the observables which exist in its image (in the Heisenberg picture). The 
sharp observables preserved by a channel form an operator algebra which can be 
characterized in terms of the channel's elements. The effect of the channel on these 
observables can be reversed by another physical transformation. These results gen- 
eralize the theory of quantum error correction to codes characterized by arbitrary 
von Neumann algebras, which can represent hybrid quantum-classical information, 
continuous variable systems, or certain quantum field theories. 

The preserved unsharp observables (positive operator- valued measures) allow for 
a finer characterization of the information preserved by a channel. We show that the 
only type of information which can be duplicated arbitrarily many times consists 
of coarse-grainings of a single POVM. Based on these results, we propose a model 
of decoherence which can account for the emergence of a realistic classical phase- 
space. This model supports the view that the quantum-classical correspondence 
is given by a quantum-to-classical channel, which is another way of representing a 
POVM. 
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Chapter 1 
Introduction 



At present, the best of our understanding of the fundamental laws of nature is 
summarized by the standard model of particle physics, together with the general 
theory of relativity. This understanding is not perfect and fails to account for 
certain cosmological and astronomical observations. In addition, there are purely 
logical reasons to want to modify general relativity or the standard model. For 
instance, we do not have a reasonable idea of how the gravitational field, which 
is modeled classically, should interact with quantum fields. Since all of matter is 
modeled at the fundamental level by quantum fields, this represents a large hole in 
our understanding of nature. Experiments are of little help because gravitational 
effects are extremely weak in the regimes where the quantum nature of fields can 
be probed. 

It is usually assumed on logical grounds that the first necessary step toward 
a consistent picture of all fundamental interactions would be to replace general 
relativity by a quantum theory of gravity. The problem of quantum gravity can be 
summarized as the search for a quantum theory which has general relativity as a 
classical limit. 

There do exist several empirical methods to find a "quantum version" of a clas- 
sical theory — a process called quantization — but these methods do not guarantee 
that the quantum theory obtained will have the classical theory we started from 
as a classical limit. Instead, quantization methods are most often used as a tool 
to describe a quantum theory using some form of classical intuition, even if the 
classical model involved does not relate to any macroscopic phenomenon. 

In fact, there is no reason to expect that there exists a well-defined mathematical 
procedure which would directly yield a quantum theory starting from its classical 
limit. It is the opposite process — namely the derivation of the effective classical 
limit of a given quantum theory in a given context — which ought to be completely 
understood. 

Given that, surprisingly, very little is known about the subject, we may hy- 
pothesize that the problem of deriving the classical limit of a quantum theory has 
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never been considered of prime importance in the past. Indeed, there is no real 
need to be certain that a given quantum theory has the right hmit when plenty 
of experiments are directly testing the candidate quantum theory itself. This may 
explain the usual acceptance of the traditional quantization prescriptions: it does 
not matter how the quantum theory was arrived at, as long as experiments say it 
is correct. 

However, quantization must be taken more seriously in the context of quantum 
gravity, since the classical limit is the only prediction that we can use to refute any 
particular model. 

In this thesis we will study the problem of understanding how a quantum system 
becomes, or appears, classical. We will do so by generalizing certain results of 
quantum information processing. Some of these results were published in References 
[U 121 E]- Our results on decoherence also appeared in Reference [1]. 



1.1 Information, channels and classicality 

Before we explain in more details the problems that we want to solve, let us clarify 
our use of certain terms which can intuitively have various meanings. 



In Section 2A we will define precisely what we mean by quantum and classical 
physical models. This terminology is uncontroversial because we only use the ad- 
jectives quantum and classical to qualify a model and not a physical system itself. 
For instance the same system could be described with a quantum as well as a clas- 
sical model. Later in the text we will be less pedantic and simply say "classical 
system" as a shorthand for "system modeled classically". If we say that physical 
systems are fundamentally quantum, we mean that at the microscopic level they 
are best described by a quantum model. Our goal will be to understand how the 
classical systems that we are familiar with are related to the underlying quantum 
description of the same physics. Of course, one could find a contrived classical 
model which is as complete as the quantum description (say Bohm's theory), how- 
ever, even disregarding the multitude of intrinsic problems associated with such a 
theory, one would still have to explain how to derive observed classical phenomena 
as a limiting case. This is not the path that we are going to follow. We will in 
fact obtain reasonable conditions under which an effective classical model emerges 
which is related to the underlying quantum model in a non-contextual way (see 



Section 5.3) 



A communication channel, or channel for short, will be understood as the most 
general way that the state of a system B can be conditioned on that of a system 
A. If A and B are both quantum or both classical, this corresponds to the usual 



notion of a quantum channel (Section 2.1.4) or classical channel (a stochastic map) 
respectively. 

We will use the word "information" essentially as a synonym of "correlation". 
Hence two different systems will always be involved in a sentence containing this 
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word. For instance we could say that a system B contains information about a 
system A. If we possess a joint state for A and B, this would mean that some 
observable of A is correlated with some observable of B. More importantly, in the 
context where the state of i? is a function of the state of A, i.e. if they are related 
by a channel, we will say that B contains information about A if, by performing 
a measurement on B, we learn something also about A. More precisely, we will 
characterize the information that B possesses about A by checking exactly which 
property (which observable) we indirectly measure on A when performing a given 
measurement on B. This is the concept that we will analyze in Chapter |3j 

This qualitative information can be quantified in certain contexts using gener- 
alizations of Shannon's theory (as for instance in Section 2.1.3). However, we will 
be more interested in certain discrete aspects of the structure of this information, 
especially whether or not it is classical (Section 5.3). 



1.2 The emergence of classicality 



It is widely believed that everything in nature is fundamentally quantum. Hence 
we need to explain why, and how, most of it actually appears classical in a wide 
range of situations. At first glance, the answer may seem obvious: things appear 
classical when we do not measure their behaviour accurately enough (compared to 
h) to detect their quantumness. The main problem with this answer is that it does 
not tell us specifically which observable of the quantum system is being measured 
when such an "inaccurate" observation is performed. Indeed, we know that there 
are many fundamentally incompatible ways of observing a quantum system. Each 
way to do it is labeled by an observable. By contrast, there is a single best way to 
observe a classical system: it suffices to measure its precise state. Everything else 
can be deduced from it. We will therefore focus on the following question: 

Question 1.1. Which quantum observable is being measured in the complete ob- 
servation of a classical system? 



This way of formulating the question of the classical limit is not standard, and, 
although it is rather natural, it requires some detailed explanations, which are the 



subject of Section 5.1 



The first thing to note, which might be a source of confusion for the reader, is 
that there is no need to assume that an observable only takes a single real number 
as its value. Although this assumption is rather innocent for a classical system, 
where a non-scalar observable can usually be seen a the joint observation of several 
scalar observables, it would be a strong conceptual limitation in the context of 
quantum mechanics where, as we know, observables cannot in general be measured 
jointly. This point is to be made in order to make it clear that measuring the state 
of a classical system amounts to observing one very special observable, namely the 
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observable which gives full information about the classical system. Indeed the state- 
space of a classical system (i.e. its phase-space) is rarely one-dimensional. Since 
any observation of a quantum system is specified by an observable, and since every 
classical system is a quantum system in disguise, this phase-space observation must 
actually correspond to the observation of a single quantum observable. Hence, our 
question: which quantum observable defines this classical system? 

In order to provide a realistic answer to this question we will need to consider the 
most general notion of an observable, which is provided by the formalism of Positive- 
Operator Valued Measures (POVM). For instance, we know that, in general, the 
various variables which parameterize a classical phase-space correspond to non- 
commuting quantum observables. This implies that the quantum description of 
phase-space must require some approximate joint measurement of non-commuting 
observables, something which is readily characterized by a POVM. Hence we will 
make extensive use of this notion, although we will specialize it slightly for technical 
and conceptual reasons (see Chapter |2]). 

Let us remark that there is another important related question, which we will 
not address here. A system can behave classically, not with respect to any obser- 
vation, but indirectly through the way that it interacts with an auxiliary quantum 
system. We are familiar with the description of a quantum system, say an electron, 
interacting with a system which is assumed classical, for instance the scalar poten- 
tial of the electromagnetic field. We know that the electromagnetic field is in fact 
quantum, therefore we must understand why it can be treated classically when we 
are concerned with the behaviour of that electron. This example involves no act 
of measurement, and is pertinent to the state of our knowledge of the interaction 
between quantum fields and gravity, where gravity plays the role of the classical 
electromagnetic field. 



1.3 Decoherence 

Any definition of a quantum measurement refers, explicitly or implicitly, to an in- 
teraction between the quantum system and a postulated external classical system 
(the observer). Note that the standard framework of quantum mechanics will be 
recovered explicitly in this way in Chapter [2j However, a quantum system interact- 
ing with other quantum systems can also undergo at least part of the phenomenon 
of collapse associated with a traditional axiomatic measurement. This is the ex- 
perimentally ubiquitous process of decoherence characterized, in its simpler form, 
by an evolution of the form 

p ^ 5^ k)(«ipK)(^i- (1-1) 



for some basis indexed by the integer i. 

Based on this phenomenon, the theory of decoherence [SI El El El El [IDl E] pro- 
vides an answer to Question 1.1 The premise is that nothing really important 
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happens during the act of observation itself. At the classical level, an observation 
only involves gathering information which is already present in the environment. 
The idea is that a system which appears classical is never isolated from the rest 
of the world — the relevant part of which we call the environment. Uncontrolled 
interactions constantly spread information about the state of the system into the 
environment. It is this process itself which selects which observable of the system 
the environment contains information about. The conscious act of observation then 
simply amounts to the observer becoming aware of this information. For instance, 
any macroscopic object constantly bounces into, or emits, photons. Simply gath- 
ering a few of these photons certainly has no consequence on the system, but may 
yield information about the position of the object at the time of the interaction 
with the photons. 

By contrast, what makes an act of observation quantum is the observer's ability 
to choose the nature of the interaction between his measurement apparatus and 
the quantum system, so as to decide which observable will be measured. Hence, for 
a system to appear classical, two conditions must be fulfilled: It must be continu- 
ously "monitored" by its environment, and the observer must learn about it only 
indirectly through the information which is already contained in the environment. 

Although the theory of decoherence has been used to tackle the infamous mea- 
surement problem related to the interpretation of quantum mechanics, we are here 
interested only in its ability to answer Question i.e. to tell us which observable 
of a quantum system defines its classical limit in a given context. 

Let us briefly summarize this theory here. In elementary presentations of de- 
coherence one considers an open quantum evolution which essentially removes 
the off-diagonal elements of the density matrix p representing a quantum state, as 



expressed in Equation The basis involved in this expression is selected by the 
dynamical process itself. Indeed, the initial states of the form p = are the only 
states which stay pure throughout the interaction. They are called pointer states. 
In addition, if this process results from a unitary interaction with an auxiliary 
quantum system, the joint final state of the two systems must be of the form 



um^\o)) = j2m\i)®\<p,) 



where {ip) is the initial state of the system, |0) the initial state of the auxiliary 
system (the environment), some orthonormal basis of the auxiliary system, 
and U the unitary operator specifying the joint evolution for some interval of time. 
Such a process has been initially studied by von Neumann [12j as an example of 
how a measurement apparatus can be modeled by a quantum system. Here the 
measurement apparatus is the auxiliary system, or environment. This process rep- 
resents a measurement of the system by the environment in the sense that a further 
measurement (now in the axiomatic sense) of the environment in the basis ef- 
fectively simulates a measurement of an observable of the system with eigenstates 
\i). Indeed, this measurement yields the outcome labelled by i with probability 
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The state collapses to \i) ® which reduces to \i) when we neglect the 
environment. The observable being measured is fixed only up to its eigenvalues, 
because we are only interested in the probabilities that it defines. In general we 
will simply consider a fiducial observable A = ^jAj|z)(i| for some distinct set of 
eigenvalues Aj which only act as labels for the eigenvectors. We will say that the 
observable A is the pointer observable. 

This is an example of a purely unitary (and hence purely quantum) process 
which selects a particular observable of the system, as specified by the orthogonal 
basis and transfers the information about this observable to the environment. 

Note that the evolution of the system alone is such that any quantum super- 
position between the states \i) is replaced by a simple statistical mixture. Indeed 
if the initial state is p = where = Yli'^il^)^ then the final state of the 

system is 

J2\z){t\p\t){z\=J2H'\^){^\- (1-2) 

i i 

Although this model introduces some essential ideas, it is too simplistic to be 
applied to realistic systems. Its main problem is that it can only explain the 
selection of sharp observables, i.e. traditional observables, and not POVMs, which. 



as we will argue in more detail in Section 5.1 , are needed for an understanding of the 



classical limit of the most common physical systems. In addition, this model does 
not explain why classicality is universal at macroscopic scales, since processes of the 
above kind are rather special. Finally, in this model, the effective classical system 
represented by the pointer observable does not have any non-trivial dynamics. 

These points have all been addressed before (see for instance |H1 E] , or the review 
section in [13j). Most of these approaches consider a single non-relativistic particle 
coupled via its position operator to a thermal bath of harmonic oscillators. One 
often assumes a Markovian approximation, which allows for the dynamics of the 
system to be represented by a differential equation in time. This is important 
as it removes the need for an explicit solution of the dynamics. In this picture, 
decoherence is always expected to take place in the position basis, due to the local 
nature of the interaction. Although such analyzes have shed important light on 
many aspects of decoherence, they have important limitations. For instance, they 
provide unsatisfactory, or limited answers to the following general questions. If the 
decoherence takes place in the position basis, what mechanism guarantees the — 
seemingly contradictory — preservation of momentum information? What singles 
out momentum as the other phase-space variable? How would we recognize an 
interaction which yields the emergence of a non-canonical phase-space structure? 
Does this process justify one quantization process over another? How is this loss 
of information in the system related to a gain of information in the environement, 
i.e. a measurement? Can some quantum information be preserved in the system 
despite the process of decoherence? 

Instead of attempting to improve on these models, we will adopt a new approach 
which generalizes in a different way the simple model that we first described. We 



Decoherence 



7 



will make no specific physical assumption. Instead we will show that the process of 
decoherence can be understood at a very fundamental level, in terms of information 
flow between the system and its environment. 

We can easily generalize von Neumann's analysis of the process of measurement 
and note that any interaction between a quantum system and another quantum 
system — the environment — yields correlations which are such that a subsequent 
measurement of a particular observable of the environment exactly simulates a 
measurement of some observable of the system. The observable of the system 
which is indirectly measured is selected by the nature of the interaction between 
the two systems. Mathematically, if the initial state of the system is p and that 
of the environment |0), and the interaction is specified by the unitary operator U, 
then the final states of the two system is U{p® |0)(0|)f/^. A measurement of a fixed 
observable with eigenstates \i) on the environment after the interaction yields the 
probabilities 

= Tr((l® |z)(2|)f/(p®|0)(0|)f/t) 

= Tr(f/t(l® |z)(z|)[/(p® |0)(0|)) 

= Tr((l ® (0|) (1 ® U{1® |0))p) 

= Tr(A,p). 

where we have only used the cyclicity of the trace, and defined the operators 

Ai := (1 ® (0|) (1 ® U{1® |0)) 

These operators specify a generalized observable on the system, namely a discrete 



POVM. Remember that we are trying to answer Question The point of this 
simple calculation is that the observable represented by the operators Ai, which is 
effectively measured on the system, is fixed by the choice of the unitary operator 
U, and hence by the physical interaction between the two systems. 

The problem is that the POVM which is selected in this process depends also 
on the observable being measured on the environment. In the special case where 
the evolution of the system alone is given by Equation there is a unique best 
observable which can be measured via the environment. "Best" in the sense that 
any other observation will simply yield less information about the same observable 



of the system (a concept which will be made precise in Section 5.3). However, this 
is not true of the more general case considered here, as can be seen by changing 
the basis \i) of the environment in the above example. Therefore, the observable 
being measured in this process still depends on the good will of an observer. The 
question is only shifted to another system: what compelled the observer to choose 
this particular observable of the environment? 

In fact, we will show in Chapter [3] that there is only one sharp observable 
(possibly trivial) which can be measured nondestructively in any such process. This 
is the reason why a particular observable is indeed selected in the simpler example 



defined by Equation 1.1 However, these requirements do not uniquely select a 
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general POVM. We will see that for a POVM to be selected uniquely we need to 
introduce another requirement, namely that the information about the observable 
be represented redundantly in the environment. 

Redundancy was previously introduced by OUivier et al. [H] in a slightly dif- 
ferent context. The authors attempted to identify the preferred observable, not via 
the full dynamical process as above, but only by the form of the final state of the 
joint system and environment after the interaction. The idea is that the preferred 
observable is that which is correlated with some observable of the environment. 
They indeed showed that a sharp observable of the system is uniquely selected if, 
in addition, these correlations are required to be redundant across many subsystems 
of the environment. Our approach, although it has a different premise, will indeed 
yield such correlations, hence accounting for their analysis. 

Our approach is based on a new method for characterizing the information 
preserved in a quantum communication channel (see Chapter |3|. The interaction 
between the system and its environment, for instance, involves two main channels. 
The first sends the initial state of the system to its final state. Equation 1A_ is an 
example of such a channel. The concept of pointer state and its generalization [15] 
is one way to characterize the information that this channel preserves. The second 
important channel is the one which says how the final state of the environment 
depends on the initial state of the system. This channel has rarely been considered 
explicitly. However, it is important because it tells us precisely what information 
the environment learns about the system, and therefore what is being measured in 
the process. 



1.4 Fighting decoherence 

We motivated the study of decoherence by the need to understand quantization. 
But understanding decoherence is also important for a completely different purpose: 
that of fighting it. 

Most introductory textbooks give the impression that quantum mechanics im- 
poses fundamental barriers on what it is possible to do with a physical system. 
However, the restrictions introduced with quantum mechanics are restrictions on 
what we can measure, or predict, but they are not restrictions on what a physical 
system can do. In fact, these additional constraints on what we can observe al- 
low for a greater freedom in the system's behaviour compared to classical physics. 
Indeed, quantum systems can behave in ways that were previously thought impos- 
sible [16]. For instance a computer based on the principles of quantum mechanics 
could solve problems which are classically intractable [I71|18]. The exact reason is 
not clear, although it can be understood intuitively from the fact that a quantum 
dynamical process seems to explore an infinite number of paths in parallel, in the 
sense of the path-integral formalism. In fact, a naive attempt at simulating the 
discrete evolution of a small finite quantum system rapidly yields an exponential 
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blow-up of the memory required to store the classical representation of the quantum 
state, as well as an exponential slow-down of the simulation. 

If we want to exploit the properties of quantum systems for some practical 
purpose, we have to make sure that we can maintain their quantumness, and prevent 
them from undergoing decoherence. This is far from being an easy task given that 
decoherence is present in every quantum experimental setups, and is typically much 
faster than thermalization [8]. 

One way to fight decoherence, or any other form of noise, is to simply avoid it. If 
some form of noise cannot be eliminated by active control of the environment, one 
may attempt to encode information in a way which will be resistant to this residual 
noise. For instance, one may be able to find a subspace which is not affected by the 
noise (i.e. a decoherence-free subspace) [191 EOl EH [22]. It is possible to refine this 
idea and find so-called noiseless subsystems [231 EH ES] . A more elaborate strategy 
is to find some subspaces, or subsystems, which are affected by the noise, but in a 
way which can be reversed. This yields to the method of quantum error correction 
[2S1 EH ESI ESI ED] , or to its refinement which allows for subsystem codes (operator 
quantum error correction) [SU [32] . 

Clearly these techniques characterize the information which is preserved by the 
quantum channel representing the noise. For instance, a subspace represents a 
property of the quantum system (a yes-no observable). If the effect of the noise 
can be reversed on this subspace, it means in particular that the information rep- 
resented by this observable has not been destroyed by the noise. This is precisely 
the kind of tool that we are looking for in order to understand the process of deco- 
herence. In Chapter |3| we will introduce a concept of preserved information which 
is more general, and better suited for our needs. In Chapter |4] we will show how 
our approach yields, special generalization of the theory of quantum 

error correction for subsystem codes [U E], which itself generalizes the other the- 
ories mentioned above. Importantly, our approach is suitable for a study of error 
correction on infinite-dimensional Hilbert spaces. In this context, we show that 
error correcting codes can be represented by any von Neumann algebras. Whereas 
a general von Neumann algebra represents hybrid quantum-classical information, 
type II and type III factors represent new exotic types of quantum codes which 
exist only on infinite-dimensional Hilbert spaces. 



1.5 Summary 

Let us summarize the problem at hand, and our reasoning. We want to understand 
by what process a quantum system becomes effectively classical, with a focus on 



the kinematical aspect of the problem. We have pointed out in Section L2, and will 
argue in more detail in Section |5.1[ that a classical limit, at the kinematical level 
(i.e. everything except the dynamics), must be specified by a generalized observable 
of the quantum theory. Therefore we have identified the type of object that we are 
looking for: a POVM. 
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We also have a good idea of the type of mechanism which could be at work to 
select this POVM. The theory of decoherence, which is firmly grounded in theory 
and experiments, tells us that this must essentially be a measurement of the system 
by its environment. An observable is indeed what characterizes the information 
gathered in a measurement. 

However, we need to understand what constitutes a measurement of a quantum 
system by another quantum system. We expect to be able to do so by studying the 
nature of the information transmitted from one system to the other. 



Chapter 2 
Quantum theory 



The role of this chapter is to introduce a number of basic concepts, as well as a 
certain point of view on quantum mechanics which will be referred to in the rest of 
this thesis. Our presentation is standard in that it follows, at least at the formal 
level, the presentation given in Kraus' lecture "States, Effects and Operations" [23] , 
which originates from [M] . The main difference stems in the fact that we are using 
a Bayesian interpretation of probabilities rather than a frequentist one. 

The expressions "quantum mechanics" and "quantum theory" will be used inter- 
changeably, just like the adjectives "quantum mechanical" and "quantum". Some- 
times, by "quantum mechanics" one refers only to Schrodinger's model of a non- 
relativistic quantum particle, so as to distinguish it from relativistic quantum field 
theory. Here we refer to quantum mechanics in a much more general sense, as in the 
"axioms of quantum mechanics" . For example, quantum field theory is an instance 
of a quantum mechanical theory, or at least tend be so (it is not fully formalized). 
Quantum theory is concerned with the common aspects of all quantum systems. 

In order to present quantum theory, we will take the view that the primary 
elements of a physical theory are yes-no questions, i.e. questions with only two 
possible answers, also called propositions in the field of logic, or effects in quantum 
foundations. We will mostly use the term "effect" once we got used to them. 
However, in the beginning we will call them "propositions" in order to remind the 
reader of their interpretation. Hence, we will attach a set of propositions E to any 
physical system of interest. 

Since we are going to have to deal with quantum mechanics in which uncer- 
tainties are unavoidable, even in principle, we will not assume that all questions 
can always be given an exact answer. Instead we assume that a typical answer will 
be a degree of confidence between zero and one in the truth of the corresponding 
proposition, i.e. a probability. Hence a certain state of knowledge about the system 
is an assignment of a probability to each proposition: 

p:E^ [0,1]. 
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Since all predictions of the theory are probabilities of the form p{a), we will assume 
that two propositions a and (3 are identical if p{a) = p{(3) for all states p. 



2.1 Finite-dimensional quantum theory 

This section is dedicated to an exposition of finite-dimensional quantum mechanics. 
This will allow us to introduce most concepts which we will refer to later, while 
avoiding the technical issues related to the full infinite-dimensional quantum theory, 



which will be covered in Section 2.3 This presentation is not meant to be an 
introduction to the subject, but rather a way to present a number of concepts 
which will be used later. Hence we assume that the reader is already familiar with 
the basic formalism of quantum theory, including the notion of density matrix. 

For a quantum system associated with a finite dimensional Hilbert space 7i, the 
propositions, which are also called effects, will be assumed to be all the self-adjoint 
operators with eigenvalues in the interval [0, 1]: 

E(7^) = {Ae M{n) I < A < 1} 

Were we wrote A^(7i) for the set of all linear operators on Ti. 

The states for this system are the functionals defined by density matrices p 
through 

A ^ Tr(pA), 

for any operator A G E(7^). The set of states therefore is associated with the set 
of density matrices 

§(H) = {p e M{n) \ p >0 a.nd Tr(p) = 1} 
We will say more about the meaning of the mathematical structures which exist 



in the sets E(7^) and §(7i) in Section 2.2 For now we will assume that they are just 



given. The purpose of this section is to show that the rest of quantum mechanics 
mostly follows from interpreting these mathematical objects as propositions and 
states. 

This way of looking at quantum mechanics, although it has a venerable history 
|34] . is not widely known. Traditionally one sees an observable as a self-adjoint 
operator X and postulate that its spectral decomposition X = Yll^=i ^i^i ^lust be 
used to compute the probabilities Tr(pPj) for each outcome i = 1, . . . ,n. Here, Pj 
is the projector on the eigenspace of X with eigenvalue Xj. Clearly, the projectors 
Pi are effects, as defined above, and the probabilities are obtained by the expected 
formula Tr(pPj). More generally one can introduce the concept of a Positive Op- 
erator Valued Measure, or POVM, which generalizes this traditional concept of 
observable. Indeed, if we let a quantum system interact with an auxiliary system, 
and then measure a "traditional" observable on that auxiliary system, we obtain 
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probabilities whose dependence on the state p of the system is given by the formula 

Tr(pAj) where the operators Aj need not be projectors. Indeed, suppose that the 
initial state of the auxiliary system is {ip), and that the common evolution of the two 
systems, until we perform the measurement, is given by the unitary time-evolution 
operator U. If the eigen-projectors of the observable measured on the auxiliary 
system are Pj, and if the initial state of the system is p, then the corresponding 
probabilities are given by Pi = Tr(f/(p {ip) (ipDW {1 ® Pi)), where the first tensor 
factor corresponds to the system and the second to the auxiliary system. Using the 
cyclicity of the trace, we easily see that this can be written as pi = Tr(p/li) where 

A := (1 {^\){U^{1 ® Pi)U){l IV'))- 

In this notation, {ipl is viewed as the linear operator mapping a state |0) to the 
number {ip\(f)). This operator, which in finite-dimension can be seen as a matrix of 
just one row, can be legally tensored with other operators. For instance, A (8) 
maps a state |0) 10') of the two systems into a state of the first system, namely 
A\(j)) {ilj\(f)') . The ket {ip) is the adjoint of (ipl (a matrix of just one column). It maps 
a complex number z to the ket z\ijj). Hence, for instance, 1 ® maps a state 
10) of the system to the composite state |0) ® \ip)- To see this, we interpret 10) as 
10) <S> 1, where 1 is the complex number one. The point is that |0) (g) 2; = 2;|0) (8) 1 
for any z, which means that tensoring with a complex number does not add any 
more information: it can be done freely, and simply corresponds to a different way 
of representing the same Hilbert space. 

In general these operators Ai representing the discrete POVM must only be 
self-adjoint, positive (i.e. with positive eigenvalues), and such that A = 1. 
This normalization condition in fact implies that the eigenvalues of Ai must also be 
smaller than 1, which is what we need to make sure that Tr{pAi) is a probability 
between and 1. These operators Ai are called POVM elements, or effects, and 
are indeed the operators in our set E(7i). Effects P G E(?i) which are projectors: 
P^ = P will be called sharp. Effects which are not sharp will be called unsharp. 

Instead of referring to the standard notion of observable and POVM, let us 
derive these concepts from scratch starting with the axioms that propositions are 
given by the set E(7Y) and states by functionals of the form A 1— > Tr(pA). This 
exercise will allow us to introduce the point of view that an observable is essentially 
a communication channel from a quantum to a classical system. 

Before we can define precisely what we mean by that, we first need to explain 
how to model a classical system also in terms of effects and states. 

2.1.1 Classical models 

A classical model is always associated with a phase-space Vt. For the moment we 
will ignore any mathematical structure that one may put on a phase-space, like 
the canonical Poisson structure of mechanical systems. Indeed, we will consider 
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contexts where Q would not have any particular structure, for instance in the 

case where it represents the internal state of a computer, and is therefore discrete. 
Note that it is not usual to call Q a phase-space when it represents a discrete 
system, however it should be clear that even in that context it plays the same role 
as that of a mechanical phase-space. Specifically, any element x & specifies a 
complete knowledge about the classical system, a concept which can be understood 
intuitively, but which will be made precise once we say what the effects and states 
are. 

Let us assume for simplicity that f2 is a finite set. The set of propositions E(f2) 
will be the set of all functions o; : f2 — > [0, 1] . These functions can be seen simply 
as vectors with components < ctj < 1 indexed hy i e Q. The interpretation is 
the following: if the system is in "state" i e il, then the proposition a is true with 
probability ctj. In particular, if the components are either or 1 then it selects 
a subset uj C Q containing only those "states" i E Q which make the proposition 
a true. This corresponds to the standard notion of a logical proposition. As in 
the quantum case, these special propositions will be called sharp. The constant 
function with value 1 will be denoted by 1 since it plays the same role as the 
identity operator in the quantum case. Indeed it is the proposition which is always 
true. 

In the quantum case, we referred to the matrix algebra (7i) which is the linear 
span of the set of propositions E(7i). Since we will want to consider maps between 
classical and quantum propositions, it is convenient to consider the equivalent of 
M.{T-C) in the classical case, namely the complex hnear span of E(f2). It consists 
simply of all complex functions on ft and will be denoted by L(Q) for the case where 
Q is finite. L{Q) has more in common with A^(7i) than the fact of being a complex 
vector space: it is also an algebra in terms of the standard product of functions. 
The most important difference with the set of operators A4{7i) is that the product 
on L(Q) is commutative. Using this product we can see that what we called classical 
sharp propositions are also the only functions a which satisfy — a, just like in 
the quantum case. One can also introduce the concept of an adjoint. The adjoint 
/* of a function / is simply its complex conjugate /, so that the real functions 
are also the self-adjoint functions. For instance, the components of an effect a 
satisfy {a*)i = a.i — ctj. We see that the range of a function a e Lifl) plays the 
same role as the set of eigenvalues of an operator A e M.{T-L). Indeed the classical 
propositions are the self-adjoint, i.e. real, functions with range inside [0,1]. In 
fact this algebra structure together with the operation of taking the adjoint ("star" 
operation ), makes LiVt) into the mathematical structure called a C*-algebra, which 
captures precisely that which is common between quantum and classical systems. 
Indeed M. (H) is a C*-algebra as well. We will see that the systems represented by 
other types of C*-algebras are in a sense intermediate between being quantum and 
classical systems, and will come up naturally once we consider the many ways that 
quantum information can be degraded by noise. 

Classical states are of course represented by probability distributions on Q, i.e. 
functions /i : — > [0, 1] which are normalized by the condition 'Ylii&nl^i ~ ^- 
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A probability distribution /i represents the functional on proposition defined by 
a (— > Just as in the quantum case, the states can be represented by 

positive elements of the algebra L{Q): 

S{Q) = {ne L{Q) I /i > and ^f^i = !}■ 

In fact, a classical system can also be represented as a quantum system with 
some constraints on the states that it can take. Indeed, if Q contains n elements, 
the algebra L{Q) can be viewed as the set of all diagonal complex n-hj-n matrices. 
The algebra operations, including the star operation (adjoint), are then exactly 
the quantum ones. In this representation, using the bra-ket notation of quantum 
mechanics, we can write classical propositions as a = Xlien This is in fact 
the way that one often deal with classical systems in quantum information theory 
(for instance this is what is done in This notation has the advantage of 

directly allowing one to deal with the interaction between coexistent quantum and 
classical systems while working entirely within the formalism of quantum theory. 

2.1.2 Observables as quantum-to-classical maps 

In general we assume that we know how to observe a classical system. This means 
that if a classical system characterized by the phase-space Q is given, it is assumed 
by default that we are also given an interpretational framework which tells us what 
the propositions E(f2) actually mean in terms of possible experimental devices. 
Said differently, we assume that we would know how to conceive experiments which 
would test the propositions represented by the effect in K{Q). 

In addition, it seems reasonable to say that we only directly observe classical 
objects, i.e. objects which are understood and modeled in classical terms, rather 
than quantum objects. This means that if we want to make any kind of observation 
of a quantum system, we need to somehow couple it to a measurement apparatus, 
a part of which is a classical system that we directly observe. This classical part 
of the measurement apparatus which is pertinent in giving us the outcome of the 
experiment will be called the pointer. Hence we assume that we can answer any 
question (i.e. proposition) about the state of the measurement apparatus' classical 
pointer, and that the answers to these questions somehow convey information about 
propositions of the quantum systems. Let Q be the phase space for the classical 
pointer, and Ti. the Hilbert space of the quantum system under observation. We will 
not need to consider the details of how these two systems might interact. Instead 
we consider the most general way that propositions about Q can be translated to 
propositions about Ti. The translation "table" must be given by a map 

X* : E(fi) ^ K{n). 

The reason for the star will be explained later. This is indeed how we will represent 
an observable on Ti, with "spectrum" Q (i.e. the set of values it can take). However 
we need some additional constraints on this map. 
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Suppose that we have some knowledge about the quantum system represented 
by a state p e §(7i). This knowledge should allow us to make some prediction 
about the classical system, i.e. it should translate into some knowledge about 
the measurement apparatus' pointer, represented by a classical state /i G S(f2). 
Mathematically, this is indeed what we get by using X* to "pull back" the states of 
the quantum system, which are functionals on E(7i). Remember that the density 
p represents the functional which sends A to Tr(pA), for any A G E(7i). Since a 
classical proposition a is mapped to the quantum proposition X*{a), we indirectly 
get the function p : a ^ Tr(p(X*(Q;)) on classical observables. However, for this 
functional to be a valid classical state, it has to satisfy certain conditions. In 
particular, classical states are linear functionals. This means that for any two 
effects a, /3 G E(Q), and complex numbers x,y, we must have 

Tr{p{X*{xa + y(5)) = xTv{p{X*{a)) + yTV(p(X*(/3)) 
= TV(p(xX*(a)+yX*(/5))). 

Since this equation must be true for all quantum states p, it implies the operator 
equation 

X*{xa + yl3) = xX*{a) + yX*{(3) 

Hence X* itself must be linear. Because of this property it extends uniquely to a 
linear map 

X* : L{Q)-^M{l-i). 

We note this fact because it is convenient to work with linear spaces. However X* 
cannot be any linear map, it must also send E(r2) C L(f2) entirely into E(7i) C 
Ai{T~C). This is exactly guaranteed if we require that it be positive, i.e. that it sends 
positive functions on to positive operators on 7i, and that in addition X*{1) < 1, 
i.e. that the eigenvalues of X*(l) be all smaller than 1. However, we must also 
require that it sends quantum states to properly normalized functionals on Vl, i.e. 
Tr(pX*(l)) = 1 = Tr(pl) for all p, which implies 

X*(l) = 1. 

We summarize this condition by saying that X* is unital . Hence, the observables 
must be represented by linear, positive and unital maps X* . These conditions are 
all we need on X* to make sure that it induces a valid map from quantum state to 
classical states, which we will write as X. Consequently, the map X is also linear: it 
is indeed the linear dual of X* . Since we arc working with finite-dimensional vector 
spaces, X* is also the dual of X, which justifies the star. The infinite-dimensional 
case will eventually justify our choice of putting the star on the map which acts on 
propositions, rather than on that map which acts on states. 

Summarizing, the dual map 

X : ^{K) ^{n) 

which pulls quantum states back to classical states, is the restriction to S(0) C L(0) 
of the linear map 

X : M{H) L{n). 
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The map X* could be specified simply as the dual of X. Just as X* needs to be 

positive, X also must be positive. Indeed it must send quantum states, which are 
positive operators, to classical states which are positive functions. The condition 
X*(l) = 1 however takes a different form in terms of X. Recall that we arrived 
at this equation by requiring that X preserves the normalization of states, i.e. if 
fx^X{p) then Eif^i-T^ip)- 

Let us now give a simple representation for the linear maps from L(Q) to A^(7i). 
Since they are linear, they can be characterized by their action on a basis of L{Q). 
We will use the canonical basis formed by the functions Xi{j) •= ^ij- The action of 
X* on any classical proposition a is, by linearity, X*[a) = Yli '^i-^* iXi) ■ Hence X* 
is entirely specified by the family of effects Xi :— X*{xi)- The condition X*{1) — 1 
reads '^^Xi — X*{1) — 1. Reciprocally, any family of propositions Xi such that 

Xi — 1 defines an observable through 

X*{a) :=^a.X,. 

i 

The dual X then sends the quantum state p to the classical state /i which maps a 
proposition a to the probability Ylii'^il^i — '^{p^*{'^)) — 'Ylii'^i^iP^i)- Hence X 
sends p to the probability distribution p, defined by 

Pi^T^ipXi). 

The unitality condition X*{1) = 1, which is equivalent to ^j-^i = 1, is also 
equivalent to J^iP'i — Tr(p)- Let us define the classical trace Tr{p) :— YliiP'i- 
Note that it is indeed the trace of the diagonal matrix X^i which represents 

the classical state as a quantum state. Hence we can summarize the condition 
5^- Pi = Tr(p) by saying that X is trace-preserving. 

This shows, at least in finite dimension and for a finite number of outcomes, 
that this notion of observable is equivalent to that of Positive Operator- Valued 
measure (POVM). Indeed, a POVM which represents an observable with only a 
finite number of outcomes belonging to the set can be defined as a family of 
propositions Ai G Ai(TC) such that ^jg^ A = 1- The interpretation associated 
with it is that, if p is the state of the system, then the outcome labelled by i is true 
with probability Tr^pAi), which is precisely what we obtained for our concept of 
observable. 

Note that effects are one-to-one with a particular class of observables, namely 
observables with two values, i.e. defined for Q — {0, 1}. Indeed, given any effect 
A e E{n), one can define the observable X : L({0, 1}) E{n) by 

Xo = X*{xo) = A and Xi = X*{xi) = 1 - A. 

This gives a particular interpretation for the effect 1 — A: it is the alternative to A 
in an observation of A. 

Let us also show how we can recover the more conservative notion of observable, 
namely what we will call sharp observables. A sharp observable X is an observable 



18 



Quantum theory 



which maps sharp effects to sharp effects, i.e. such that for all sharp classical effect 
Xuj (the characteristic function for a subset C fi), the image X*{xuj) is a projector, 
i.e. {X*{xuj))^ = X*{xuj)- In fact this is equivalent to requiring that Xf = Xi for all 
i e n. Indeed, if Xf = Xi, then Xi-Xf = {l-Xi)Xi = 0. But 1-Xi = J^j^i Xj. 
Hence also J^j^^i ^i^j^i = 0- Since this is a sum of positive operators, this implies 
that XiXjXi = {XiXj)*{XiXj) = for all j, which in turn implies XjXi = for 
all i and j. From the orthogonality of the effects Xi it is easy to see than any sum 
X*{Xi>j) = J2iei^-^^ ^ projector as well. 

In fact, this also proves that these projectors are automatically orthogonal. 
Since also they are complete in the sense that Yli-^i ~ 1' then they can be seen 
as the projectors on the eigenspaces of a self-adjoint operator A = ^i^Q KXi, 
which is indeed the standard notion of an observable. Here Xi can be any family of 
distinct real numbers. In fact, to be pedantic, we could make sure that C M and 
write A = X^Aen"^"^*' However the actual eigenvalues matter little for a discrete 
observable. 

2.1.3 Capacity of an observable 

Intuitively, measuring a sharp observable should yield more information about the 
system than measuring an unsharp one, provided that they have the same number 
of outcomes. Viewing observables as communication channels yields a natural way 
to make this concept precise, namely the information capacity of an observable. 

The idea is the following. Suppose that Alice can prepare a given quantum 
system in any state she wants, and Bob is going to measure the observable X of 
this system. If the two can agree of a protocol beforehand, how many bits can 
Alice communicate in this way to Bob? In order to make this more precise, one 
must consider that they will actually use an arbitrary large number of systems, 
and see how many bits Alice can transmit per system, in average, as the number 
of systems goes to infinity. This is a direct parallel to the definition of capacity 
for a classical channel. In fact, it can be reduced to it, as shown by the Holevo- 
Schumacher- Westmorland (HSW) theorem |35j . 

Indeed, the process by which Alice will encode her classical bits into a quantum 
state is itself a channel, in this case a classical-to-quantum channel, also called a 
state-preparation procedure. Let us call such a channel 0. Since is linear and 
acts on L{Q), it can be characterized by its action on the basis states Xi- L^t us 
write Pi = (l){xi) ior the quantum states which are image of these basis states. For 
a general classical state /i = J2if^iXiy we have 




If we compose this classical-to-quantum channel with the quantum-to-classical 
channel X, we obtain simply a classical channel Xocp, for which a Shannon capacity 
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C{Xo(j)) can be computed. The HSW theorem then states that the classical capacity 
of the channel X is given by 

C{X) = supC(Xo0). 

Since the HSW theorem is usually expressed for general quantum channel, this 
formulation is not usual. Let us therefore show that this is indeed equivalent to the 
formula given in , namely 

C{X) = SUp(S{X{J2^^^P^) - $^/i.^(X(p,))) (2.1) 

Where, by S{fi), we mean the von-Neumann entropy of the quantum state whose 
eigenvalues are given by the classical distribution /i, which is simply the Shannon 
entropy of p: S{fi) = H{fi). 

Note that C{X o 0) is the maximum over all initial states fi of the mutual 
information for the joint probability p{i, j) of getting outcome Xj and input Xi- This 
joint probability is given by p{j\i)fii, where p{j\i) = X{(f){xi))j is the probability to 
get outcome Xj given that the input was Xi- Therefore p{i,j) = X{(j){xi))jfJ'i and 
has marginals ^i(f'iXi))j = /^i and '^iX{(f){xi))jfJ.i = X(0(/i))j. We therefore 
have 

CiX o 0) = sup[/f(X(0(/i))) + H{^i) - Hip)]. 



But note that 

H{fi) - H{p) = - J^/iilog2(/i») + ^p{iJ)^og2{p{iJ)) 



= -^^^i[^og2i^ii) - ^pi3\i)\og2ipii,j)) 

i j 

= '^f^i^PUli) log2(p(j|0) 

i j 

= -J2^^^H{x{4>{x^))) 

i 

and so, 

C{X) = sup C{X o 0) = sup[/7(X(0(/i))) - J2 ^^^H{X{<P{x^)))] 
= sup (s{xC^fXiPi)) - ^^iH{X{pi)) 



which is indeed equal to Equation 2.1 



When applied to a discrete sharp observable, this quantity is equal to the loga- 
rithm of the number of outcomes. Indeed, if X is sharp, then the optimal strategy 
for Alice consists in encoding her classical states into eigenstates of X. The number 
of states that she can encode in this way is the number of distinct eigenstates n of 
X. This corresponds to log2(n) bits. 
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2.1 A Quantum channels 

We want to consider the most general way that a quantum system B can encode 
information about another quantum system A, as specified by a map 

£* : EiHs) ^ H^a) 

which translates each proposition on system B into a proposition of system A. All 
the points made in the discussion of observables are still valid here, even though 
we have replaced the classical system by the quantum system B. Indeed the only 
algebraic properties of states and effects that we have used in our presentation 
involved the linearity and the positivity of effects and states, which form the same 
algebraic structure in the quantum and in the classical case (more will be said about 



what this structure is in Section 2.3). Hence the most general such map £*, which 
is also required to induce a valid map from states of system A to states of system 
B, must extend to a unital positive linear map 

s* ■.M{nB)^M{nA). 

This map is the dual of a trace-preserving positive linear map 

S : MiHA) ^ MiHB) 

which maps states of A to states of B. The relation between the two can be simply 
expressed from the fact that they yield the same probabilities: 

Tt{S{p)A) = Tt{pS*{A)) 

for all states p and all effects A. 

However we need another important condition which happens to be implicitly 
satisfied in the case of the quantum-to-classical maps which define observables. We 
need to make sure that if our systems A and B were only parts of larger systems, 
then our map could be trivially extended to those larger systems. This means 
that, if we add a third system C with Hilbert space TCc, then the map £* (S> id from 
M{nA®'Hc) to M{nB®nc) defined by {£* ®id){X®Y) := £*{X)®Y, should also 
be unital, positive and linear. Unitality and linearity come for free. However the 
positivity must be postulated. If S* ® id is positive no matter what the dimension 
of the auxiliary system C is, then we say that S* is completely positive. In this case, 
the dual £^(8>id is also automatically completely positive. This condition is not only 
physically reasonable, but also mathematically extremely useful since it allows one 
to apply a powerful representation theorem by Choi [36] , which states that there is 
a family of operators (the channel's elements) such that 

k 

from which it is easy to see that also 

£ip) = J2EkpEl. 
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In fact any map of this form is completely positive (and of course linear). Unitality 
of S* requires in addition that 

k 

Let us state and prove Choi's theorem for completeness. 

Theorem 2.1. A linear m,ap S from Ai{T-CA) to M.{l-iB) is completely positive 
if and only if it has the form £{p) = "^kEkpE^. for a family of linear operators 
Ek ■ Ha Hb ■ 

Proof. First, it is clear that a map defined by £{p) — E^pEI is positive. Indeed, 
if p is positive then EkpE^ is a sum of positive operators, which is also positive. 
In addition, its extension {£ ® id)(p) = Ylik^^k ® ^)p{Ek ® 1)* has the same form 
and is therefore positive too, which show that £ is completely positive. For the 
converse, suppose that £ is completely positive. Let us view £ as going from the 
complement of its kernel ker(£')-'- to its range ran(£^). Since these subspaces have 
equal dimensions, they can be both identified with the same space Ti.. Consider 
a basis \i) of H. Consider also the Hilbert space 7i ® 7i and the operator A :— 
® acting on it. Note that A is positive. Indeed for any state 

1^) :— l'^) (g) 10) with components (^|''/')OI0)) we have 



Since £ is completely positive, {£ (g> id) (A) is a positive operator on Ti^ ® Ti-B- 
Therefore it has a complete set of orthogonal eigenvectors 1^^) € Ha ^Hb- This 
means that it can be written as 

{£®id){A) = Y^\vk){vk\ 

k 

where the norm of the vector \vi) is the square of the corresponding eigenvalue. 
Explicitly this means that 

ij k 

by multiplying by 1 (8) {j\ on the left and 1 (8) |i) on the right we get 

m){J\) = J2(^®{J\)\vk){vk\{l®\^)) 

k 

which is equivalent to 

^(K)(j|) = E(^^l(IO(^l®K)01)k) 

Ik 

= E(^'^l(IO(^l®l001)k) 

Ik 

^Y.i^,m^i)\i){j\{{i\0i)\vk) 

Ik 
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But since form a basis of all operators on H, we have by linearity 

Ik 

for any operator p on 7i. This is the representation that we seeked, with elements 

Em := KKIO®!)- 

□ 

Such a map between states, namely a trace-preserving completely-positive linear 
map, will be called a quantum channel, or simply a channel. In order to avoid 
confusion, we will only call a channel the map £ on states. £* will simply be called 
its dual. We may also call an observable X a quantum-to-classical channel since it 
sends quantum states to classical states. 

2.1.5 Transformation of observables: the Heisenberg pic- 
ture 

All observables X are basically made out of effects, namely those in the image 
of X*: X*{a) G E(7Y). For convenience, we will refer to them as the effects of 
X. Since the dual of a channel maps effects to effects, it also maps observables to 
observables. Indeed, consider the observable Y* defined by Y*{a) := £*{X*{a)) for 
all classical effects a. To see that Y* is indeed an observable, simply note that it 
is unital, positive and linear, because both £* and X* are. This equation can also 
be written as 

Y* =e*oX* or Y ^XoS 
or simply in terms of the elements Yi = Y*{xi) and X^ = X*{xi) 

Yi^S*{Xi) 

An alternative way of writing this equation, which is intuitively helpful, is as a 
commutative diagram showing the flow of information between states of the three 
systems involved, namely the source quantum system A, the target quantum system 
B, and the classical pointer fl. 

M{Ha)^-M{Hb) 

Y 

L{Q) 

Each object can be seen as a physical system, be it quantum or classical, and each 
arrow as a transfer of information between systems. An arrow from a quantum to 
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a classical system corresponds to an observation. We will see later that this can be 
formally seen as a diagram in the category where the objects are the pre-dual of 
von Neumann algebras and the morphisms the trace-preserving completely-positive 
linear maps between them. The correct way to read such a diagram is as a set of 
equations stating that each possible path between two objects are equal. 

This is how the channel S acts on observables. In the case where £ represents 
some time evolution for a certain interval of time, this is simply the Heisenberg 
picture. Indeed, in the case of a unitary evolution Ut, remember that we get the 
Heisenberg picture by noting that what matters are expectation values of the form 
{il){t)\A\ilj(t)) for a self-adjoint operator A representing a sharp observable, and for 
the time-dependant state \ip{t)) = Ut\ip). These expectation values can alterna- 
tively be written as {ipp; AUt\ij) = (^l^(^)l^), where we defined A{t) := U^AUt. 
Hence we can either assume that our knowledge represented by the states evolves 
as a function of time (the Schrodinger picture), or alternatively assume that our 
knowledge is fixed, but that it is our interpretation of what the observables are 
which changes with time: a point of view called the Heisenberg picture. A channel 
£ specifies a more general type of evolution, however the scheme is the same. We 
took the more general expectations values (probabihties) of the form Ti{pA) as the 
fundamental predictions of the theory, and noted that a given map £* on effects also 
induces a map £ on states through the relation Tt{£{p)A) = Tt{p£*{A)). Hence 
the map £*, which says how effects evolve in the case of a fixed state, specifies 
the evolution in the Heisenberg picture. However, writing an observable X as a 
channel, rather than in terms of its elementary effect Xi, we can also write the 
Heisenberg evolution using the "Schrodinger" channel £, as X t-^Y — X o £. 

However there is something that this more general picture renders apparent 
which is normally overlooked in the unitary case. It should be clear by now that 
if the channel maps states of a system A to states of another system B, then the 
effect of the channel in the dual Heisenberg picture is to map observables of B to 
observable of A, i.e. it goes "backward" . This can also be seen by looking at what 
happens if we compose two channels £ and £'. We get the new channel £' o £. 
But in the Heisenberg picture the action is given by {£' o £)* = £* o [£')*. Hence 
in the Heisenberg picture we have to apply the latest channel first. If the channel 
represents a time evolution, one way to interpret this is to think of the observable 
Y — X o £ as that which, if measured at the initial time, would yield the same 
result as X measured at the later time. 

2.1.6 Measurements 

We have argued that the most general way that a classical system with phase-space 
Q can hold information about a quantum system defined by the Hilbert space 7i, is 
specified by a trace-preserving linear map X : Ai{T-C) L{Q). But if we actually 
perform this measurement, we may expect that the quantum system that we have 
measured still exists afterward. This means that the process of measurement should 
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map the state of our system not only to the state of our measurement apparatus' 
pointer, but also to the later state of the system itself as well. Hence it should be 
a channel 

£ : Min) ^ Min) ^ L{n) 

with dual 

£* : Min) ^ L{n) ^ Min) 

where H is the Hilbert space of our quantum system and Q the phase-space of the 
measurement apparatus' pointer. 

In order to see what the tensor product between a quantum and a classical 
system is, consider the representation of classical states and classical effects as di- 
agonal quantum states and diagonal quantum effects. For instance, the classical 
effect a can be represented as a = ^jg^ tensor product with a quan- 
tum effect A then is simply a = Ylien'^i^ ® This shows that in this 
representation, the space M.{l-C) ® L{Q) consists of block-diagonal matrices. It is 
in fact a subalgebra of the algebra of full matrices. It defines in its own right a 
physical system which is not quantum nor classical, but hybrid quantum-classical. 



We will see more general structures of this type in Section |2.3| where they will be 
formalized as C*-algebras. 

For the process represented by S to represents the measurement of a specific 
observable X we require that 

S*{l^a) = X*{a) (2.2) 

for any classical effect a G E(r2). Indeed, l®a is an effect on the joint system which 
simply ignores the quantum system, since 1 is true no matter what the state of the 
quantum system is. Note that the dual of the completely positive map a i-^ 1 (8> a is 
the partial trace over the quantum system. Indeed, given a basis \i) of the quantum 
system, we can write this channel explicitly in terms of its elements: 

a^^\i){i\®a = ® 1) a ® 1). 

i i 

We therefore immediately obtain the dual map on states by swapping the elements: 
p (S) /i t-^ ''^{{A ® l)(p ® ® 1) = ^(^IpK) P = Tr(p)p = p. 



Equation 2.2 implies that £*{1 ® Xi) = ^oi all i G fl. Note that the maps JF* 
defined by 

J^iA) ■.= £*iA^x^) 

from Ai(TC) to itself are completely positive, although not unital. These maps can 
be anything apart from the requirement given by Equation 2.2, which is 
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Hence the most general form of a measurement of the observable X is as a map 

i 

for some completely positive maps J-'* which satisfy ^/(l) = Xi. The dual S is 
given by 

i 

If we represent the classical system as a diagonal quantum system, we could write 
this as S{p) = ^i^i{p) <S>\i){i\. 

Hence, if our knowledge of the system was represented by p, then after it has 
interacted with the measurement device, our joint knowledge of the system and the 
pointer is represented by the state ^i{p) ® Xi- order to see what this implies, 
let us consider a second measurement after this one. 



2.1.7 Repeated measurements and collapse 

Suppose that we perform a measurement of X as specified above, followed by a 
measurement of a second observable X' associated with the phase-space Q'. By 
composing the measurement S with the channel X' applied on the final state of the 
quantum system, we obtain the purely classical state p of the two pointers: 

i ij 

This is simply the functional way of writing the joint probability distribution 

Pij = Tr{J^i{p)X'^) 

where i E Q is the outcome of the first measurement and j £ Q' the outcome of the 
second. From this joint probability distribution we can compute the conditional 
probability p{j\i) of obtaining outcome j G Q' in the second measurement, knowing 
that the outcome of the first measurement was i e fi: 

ji^^TTi^^ip}x;^ 

= Tr(AX;) 

where we have introduced the quantum states 

Hp) 

Tr{J-^ip))' 



Since X' is not involved in the definition of pi, we see that after the measurement 
of X, and the finding of the measurement apparatus' pointer in state i E Q, our 
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knowledge about the quantum system can be updated to the new state p^, which 
has the usual form given by the collapse postulate. 

For instance, if the observable X is sharp, in the sense that all the effects Xj = Pj 
are sharp: = Pi, then one possible choice of measurement apparatus is given by 

J^iip) := P,pP, 

This indeed defines a measurement of X, since ^*(1) = Pf = Pi- We then obtain 
the usual state update formula 

^ P.pPi 
Tr(P,p) • 

2.2 Effect algebras 

This section is meant to give some logical motivations for the mathematical struc- 
tures that we postulated on both the quantum and classical sets of propositions. 
It is interesting to note that a physical transformation (channel) is not required to 
completely preserve all of these structures. For instance, the product of effects need 
not be preserved. The main algebraic structure which is preserved is the addition 
of effects. We introduce here one program of axiomatization of quantum mechanics 
which is based on this structure |37] . 

Let us suppose that the set E represents the propositions associated with some 
physical system. We assume that we also have a set of states § on it, which assign 
a probability to each proposition. We want to define on E a generalization of the 
logical "or" between disjoint propositions. We write 7 = a+jS for three propositions 
a, P and 7 whenever p(7) = p(a) + p(/3) for all states p. Addition of probabilities 
is indeed what we expect for the logical disjunction of two propositions. In general, 
the sum a+P cannot be defined for all pairs of propositions (3 and a since p(a)+p(/9) 
might be greater than 1. Pairs of propositions for which the sum exists will be said 
to be valid alternatives. The sum a + P, when it exists, can always be assumed to 
be unique, given that the corresponding probabilities are defined uniquely by the 
relation p(a + P) = p{a) + p{P) for any state p. 

One property that we get for free from this definition is that a + P = P + a. 
Many interesting results can be obtained with this structure if we postulate a few 
useful axioms which make E into an effect algebra [37]. In order to get a sort of 
associativity, we assume that if a + P and 7 + (a + /?) are defined, then (7 + a) + 
is defined too. From the definition of the addition as it relates to states, we get 
{■y + a)+P = 'j + {a + P). Next we postulate that there are two special propositions: 
the proposition which is always false, i.e. p(0) = for any state p, and the 
proposition 1 which is always true, i.e. p(l) = 1 for any state p. We then assume 
that for any proposition a, there is a proposition P such that a + P = 1. P can be 
written as /3 = 1 — a. It can be interpreted as the negation of a. Indeed for any state 
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p such that p{a) = 1, we have p(l — a) + p{a) = p(l) = 1. Hence p(l — a) + 1 = 1, 
which imphes p{l — a) = 0. This shows that the complement 1 — a is false whenever 
a is true. Conversely we also have that 1 — a is true whenever a is false, which is 
what we expect of the negation of a. 

The last axiom needed to define an effect algebra is that 1 + a exists only for 
a = 0. In addition, we assume for the moment that there are no other restrictions 
on states other than those implied in the definition of +, and 1. This means that 
the set of states consists of all the functions p from E to the interval [0, 1] which 
are such that p(a) + p{/3) = p{a + (3), p(0) = and p(l) = 1. 

An interesting property of an effect algebra is that it automatically comes with 
a partial order defined hj a < (3 whenever 7 + a = /? for some proposition 7. This 
relation can be understood as a form of logical implication from a to (3. Indeed 
for any state p such that p(a) = 1, i.e. any state for which a is true, we have 
< piP - a) = p{l3) - p{a) = p{l3) - 1. Hence 1 > p{j3) > 1, which implies 
p(/?) = 1, i.e. P is true. Hence a implies /?. 

2.2.1 Linear effect algebras 

The effects algebras of quantum and classical systems have an additional simple 
structure in common which makes the above axioms more concrete and easier to 
understand. 

Suppose that we are given an ordered vector space V with a special element 
e > 0. An ordered vector space is a vector space with a partial order < satisfying 
M < f if and only if < f — m, and < Xv for all < f and A G M+. Then an effect 
algebra is automatically formed by those elements of V which are larger than the 
zero vector and smaller than e: 

E{V) ■.= {aEV\0<a<e}. 

This set E(^) is an effect algebra for the partially-defined addition defined by the 
addition of vectors, with the false proposition given by the zero vector and the true 
proposition given by the specially chosen vector e. The natural partial order of 
this effect algebra is of course the one which we started from. Hence what we have 
done was to use the partial order to tell which vectors can be added. Indeed we 
have that a + (3 exists exactly when a + f3 < 1. Such a structure is called a linear 
effect algebra |^38j. Note that since we are only interested in the vector space in the 
extent that it defines the effect algebra, we will always assume that V is the span 
of the effect algebra E{V). 

Note that the states are automatically given by linear maps on V. Indeed, a state 
p is required to satisfy p(a + (3) = p{a) + p(/5) for any effects a and /?. This means 
that for any positive integer n such that na < 1, we must have p{na) = np{a). 
In addition, for any positive integer m we have a = in^a, hence p{a) = mp{^a), 
which implies p{^Oi) = ^p(a). Combining the two results, for any positive rational 
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q — ^ and effect a such that qa < l,we have p{qa) — qp{a). In addition, note that 
a < f3 imphes p{a) < p{(3). Indeed, a < (3 means that there exists < 7 < 1 such 
that q; + 7 = /3, from which we have p{a) + p(7) = p(/9), and hence, p{a) < p{l3). 
Now for any real < r < 1, consider a sequence < g„ < r G Q and a sequence 
1 > > r e Q both converging to r. We have qnP{oi) = p{qnOi) < p{ra) < 
piq'nf^) — q'nPioi) for all n. In the limit, this implies rp{a) — p{ra). 

2.2.2 Morphisms between effect algebras 

We want to consider maps between two effect algebras: 

We have seen that once we specialize to quantum and classical systems, the inter- 
esting maps are those which preserve the addition of effects, i.e. 0(q!) + is 
defined whenever a + {3 is, and 

0(a + /?) = 0(a) + 

This automatically implies that 0(0) = 0. But in addition we require that 

0(1) = 1. 

These assumptions are what we need for the map to make sense as a transfor- 
mation of states. Indeed, consider a state pA of E^. We automatically obtain a 
function pB on E^ defined by psictB) = PA(0(ttB)), i-e. 

Pa ^ Pb = Pa° 4> 

Although the function pB is always defined, it may not be a state. It will be a 
state if it satisfies Pb(q; + P) = Pb{o:) + Pb{i3) and Pb(1) = 1- The first condition 
implies that p^(0(a + (3)) = pyi(0(Q;)) + pa(0(/9)) for all pA, which implies that 
0(q;-|-/3) = (f){a)+(f){P). Prom the second condition we have 1 = Pb(1) = Pa(0(1)) = 
Pa{^) for all Pa, which implies 0(1) = 1. Maps having these properties will be 
called morphisms as they preserve all the structures of the effect algebras. They 
correspond to the most general transformations which maps states of A to states 
of B and effects of B to effects of A. 

For both quantum and classical systems, these morphisms almost define what 
the possible physical transformations are. In the quantum case we will just need 
one additional assumption which will be that must also be a morphism if the 
system of interest is only part of a larger system. However, we have not introduced 
the structure needed in order to define a subsystem in this abstract framework. In 
fact, the question of what structure one needs to add to an abstract effect algebra, 
if any, in order to define this concept, is an open research problem. 

If E(yA) and E(T/b) are two hnear effect algebras built respectively from the 
vectors spaces Va and Vb, then the morphisms are the linear maps from Vb to Va 
which are positive, and such that 0(1) = 1. The map is called positive if 0(q;) > 
whenever a > 0. This is all that is required for to preserve the order structure 
and therefore the induced effect algebra structure. 
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2.2.3 States 

Let us say a little bit more about states. Note that the real interval [0, 1] itself is 
a linear effect algebra in terms of the addition of real numbers, and the natural 
complete order of real numbers. Hence the set of states on an effect algebra E is 
simply the set of morphisms from E to [0, 1]. 

If E is a linear effect algebra spanning the vector space V then the states are 
elements of the dual space V*. For instance, when V is finite-dimensional, V* is a 
vector space of same dimension as V. Note that V* is an ordered vector space like 
V. Indeed, for an elements p E V*, we have p > if p{a) > for all a > 0. This is 
of course the kind of positivity which is required of a state p. Therefore states are 
positive elements of the dual space V*. In addition we have to require = 1. In 
order to make connection with what follows, let us define the trace of an element 
p eV* as 

Tr(p):=p(l). 
Then we can write the set of states as 

§(V) = {p e 1/* I p > and Tr(p) = 1} 

Remember that a morphism cf) between two effect algebras acts on states through 
p po(j). This map defines the linear dual (f)*{p) = po(p- The positivity of (j), which 
is defined by (f){a) > for all a > , is equivalent to p{(j){a)) = (f)*{p){a) > for all 
states p and all a > 0, which in turns means (p*{p) > for all states p, or simply 
for all positive functionals p > 0. This means that positivity of (p is equivalent to 
the positivity of 0* in terms of the dual order structure. The last defining property 
of (f) is the property 0(1) = 1. Let us see what this means in terms of the dual. 
This condition is equivalent to p(0(l)) = </)*(p)(l) = p(l) for all functionals p. This 
simply means that 0* preserves our notion of trace: Tr(0*(p)) = Tr(p) for all p. 
Therefore the action of a morphism on states takes the form of a trace-preserving 
positive linear map. 



2.2.4 Effect algebra of a C*-algebra 

Both quantum and classical sets of propositions have a number of properties in 
common beyond the fact that they are effect algebras. First, as suggested above, 
their effect algebras are both defined from a vector space, i.e. are linear effect 
algebras. Again this means that we are given a vector space V, a partial order < 
and a special element e > 0, and that the effect algebra is defined by E = {ck e 
V \ < a < e}. The additional structure that we will fix can be seen as a way of 
specifying the partial order relation on our vector space. 

We will assume that K is a complex vector space, and also has a product between 
elements which make it into an algebra, an involution a a* which makes it into 
a *-algebra, and a norm which satisfies the axioms of a C*-algebra, namely 

||q;/3|| < ||q;|| and ||q;*q;|| = 
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In addition we assume that e is the algebraic unit of V. Since it is an algebra, let 
us call our vector space A rather than V. Also we will write 1 instead of e. 

Let us now explain how the partial order is derived from the C*-algebraic struc- 
ture. We can define the spectrum a{a) of an element a E Ahj 

a{a) = {A I 1 — Aa has an inverse} 

We then say that a > if a* = a and a{a) C [0, oo). This defines a partial order 
between vectors through 

a < P when P — a > 0. 

This is the partial order which defines the effect algebra associated with our C*- 
algebra A. 

The main theorem of the theory of C*-algebra states that all C*-algebras can 
be represented as >K-subalgebras of the set B{H) of bounded operators acting on a 
Hilbert space Ti. This is what makes it an appropriate axiomatization for quantum 
mechanics. 

There is an important physical concept naturally encoded in the structure of 
concrete C*-algebras which is not present in the pure effect algebra formalism, 
namely the notion of independent subsystems. Given two C*-algebras represented 
as operators on a Hilbert space: A C B{TC) and A' C B{T-C'), one has the tensor 
product A® A' C B(H ® H') which describes the physical system composed of A 
and A'. It is the algebra spanned by all pairs of the form a ® a' where a E A and 
a' e A', with the product defined pairwise: (a ® (y'){P ® P') = (a/5) ® {a'P'). The 
algebra A exists within A^ A' in the form of the operators a ® 1 where a E A. 
Reciprocally, the elements a' G A' are represented in A^A' through the operators 
1 (g) a'. 



2.3 Infinite-dimensional quantum theory 

Let us present a possible way of extending the quantum formalism presented in 



Section |2.1| to infinite-dimensional Hilbert spaces. The formalism that we will 
present is standard, in its most general form, in algebraic quantum field theory, 
but it also reduces easily to a convenient formalism for non-relativistic quantum 
mechanics. 

A lot of subtleties associated with infinite-dimensional quantum mechanics can 
be understood as coming from the fact that an infinite- dimensional vector space 
is not isomorphic to its double-dual. This means that we have to be careful in 
choosing whether we want the states to be functionals on effects, or the effects to 
be functionals on states. 

In the following chapters, we will make essential use of sharp effects, i.e. projec- 
tors. This implies that we should work not with general C*-algebras, but instead 
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with their closure in the weak topology, which are von Neumann algebras. Indeed, 

von Neumann algebras are spanned by their projectors, which implies that many of 
their properties can be deduced from the properties of the projectors themselves. 

The Von Neumann algebras which represent typical classical systems arc how- 
ever rather large. This implies that if we were to assume that states are linear 
functionals on the effects, which span the algebra, there would be an awful lot of 
them. We will see that it is in fact more convenient to assume that the effects are 
functionals on states, i.e. that the von Neumann algebra is the dual of the span 
of the set of states. This is where another advantage of von Neumann algebras 
appears: they are always the dual of something, namely a Banach algebra (i.e. a 
C*-algebra without the axiom which states that ||ci;*q;|| = ||ci;||^). If ^ is a von 
Neumann algebra, we will write A* for its pre-dual, i.e. the Banach algebra which 
is such that (A*)* — A. 

For instance, non-relativistic quantum mechanics can be formalized in a satis- 
factory way by using for A the set of bounded operators on a separable Hilbert 
space Ti, which we write as ^ = B{l-C). Its pre-dual can be identified with the 
Banach algebra of trace-class operators A^ = Bt{7i). The trace-class operators are 
the elements A e B{T-C) for which the expression 

^(i|VA^4|i) 

i 

converges for any basis For self-adjoint elements p e Bt{7i), this defines the 

trace 

i 

In fact, the product of an element of p e BtiH) with any operator A G B{T-[) is 
also trace-class, which implies that we can define Tr(pA). This is how Bt{Ti.) is 
identified with {B{7i))*. Indeed, p G Bt{H) defines the functional 

A ^ Ti{pA). 

A general von Neumann algebra is equipped with a weak-* topology, which 
amounts to defining the convergence of a sequence An G B{H) in terms of expec- 
tation values. I.e. we say that An converges to ^ as n ^ oo if p{An) converges to 
p{A) for all states p E A^,, which makes some physical sense. In the case of B{Ti.), 
this means that the sequence converges to A if the numbers TY[pAn) converge to 
Tr(pA) as n — ^ oo for all states p G Bt{7i). 

This implies that the states represented by elements of Bt{7i), seen as linear 
functionals of effects, are continuous with respect to the weak-* topology. In- 
deed, if the sequence {^n}^i converges to A in this topology, then, by definition 
Tr(pAn) Tt(pA). Hence the map A i— Tr{pA) is continuous. Conversely, those 
are all the weak-* continuous positive linear functionals. Therefore, our choice of 
states corresponds to restricting the natural set of all linear functionals on effects 
to only those which are weak-* continuous, or normal for short. 
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Note that a von Neumann algebra, when seen as a subalgebra of B{H), also has 

another weak topology, which is simply called the weak topology. It corresponds 
to defining the limit of operators only in terms of pure states, i.e, we say that An 
converges weakly to A if {ip\An\ip) converges to ('i/^IAI'^) for all \ip) G 7i. Hence 
convergence in the weak-* topology is a stronger condition. 

2.3.1 Classical, quantum and hybrid systems 

A classical system will be assumed to be specified by a commutative von Neu- 
mann algebra, which can always be represented as the algebra of (almost 
everywhere) bounded functions defined on a measure space Q. This set Q can be 
interpreted as the phase- space. The fact that f2 is a measure space means that 
functions on it can be integrated. The pre-dual, which contains the states, is then 
identified with the algebra L^{fl) of absolutely integrable functions on fl. These 
are the functions fi which are such that their absolute value x i-^ |/^ ('^)| has a finite 
integral over Q. The state represented by the positive function /j, e L^{Q) is then 
simply given, for a G L°°{il), by 



Hence the states are standard continuous probability distributions. We see that this 
formalism only handles probability measures on ft which are absolutely continuous 
with respect to the default measure on Q. If we need a more exotic classical state, 
we may need to take a different Q. For instance if we choose = M with the 
standard Lebesgue measure, we will not have any discrete probability measure (i.e. 
no delta function). However, we can include them by using instead the disjoint 
union Q = R U N. 

A general von Neumann algebra A has a center, denoted by Z{A), which is 
the commutative subalgebra containing the elements of A which commute with all 
other elements of A. In general, the center of an algebra can be understood as the 
"classical" part of the algebra, since it characterizes observables which commute 
with everything else, and are therefore not subject to any uncertainty relation. In 
the case of a commutative algebra, this center is obviously the whole algebra. 

From this point of view, it makes sense to define a "pure" quantum system 
as being defined by an algebra whose center is trivial. The center is trivial if it 

contains only elements of the form zl, z & C Such an algebra is called a factor. 
There exists three main types of factor. Most of the time we will assume that we 
are working with BiTi) which is the most general form of a factor of type I. We note 
that factors of type II are rarely used in physics, and factors of type III appear in 




The normalization of the state // is therefore given by 
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algebraic quantum field theory. Remember however that all von Neumann algebras 
can be seen as subalgebras of some B{TC). 

Note that this formalism, with the physical interpretation based on states and 
effects, works fine even if the von Neumann algebra A is not a factor, nor commuta- 
tive. In fact we will see that these systems can be decomposed into factors labelled 
by a classical variable. In this sense, they represent hybrid quantum- classical infor- 
mation 

Let us summarize this in a definition: 

Definition 2.1. A physical model characterized by a von Neumann algebra A 
will be called classical if A is commutative, quantum if ^ is a factor and hybrid 
otherwise. 

Since all commutative von Neumann algebras are of the form we will 

always explicitly use this representation for classical systems. Also note that we 
will in general only be working with factors of type I which are of the form B{T-C). 
In addition, we will often assume that Ti. is separable. 

A special case to point out is the von Neumann algebra C which is both com- 
mutative and a factor. It has only one state, and therefore represents a system with 
no degree of freedom. It is such that C^A = A^C = A for any von Neumann 
algebra A. Indeed, z ^ A = 1 ® zA for any 2; G C, which means that A can be 
identified with A through the map z ^ A ^-^ zA and its inverse A 1 ^ A. The 
same argument works for A^ C 

Let us list a few facts about von Neumann algebras that we will use 

• The commutant of any "self-adjoint" set of operators (i.e. closed under the 
dual operation) is a von Neumann algebra [40J. 

• A von Neumann algebra is the norm-closure of the span of its projectors |10] . 
This also implies that it is the weak-* closure of the span of its projectors. 

• A commutative von Neumann algebra is always of the form L°°{Q) for some 
measure space fl [Hj. 

• A factors of type I is isomorphic to B{T-l) for some Hilbert space Ti |41] . 

• If H is separable, then Bt(H) is separable |12] . 

2.3.2 Channels 

A communication channel from a system represented by the algebra Aa to a system 
represented by Ab can be defined, as in the finite-dimensional case, by a unital 
completely positive linear map 

£* -.Ab^ Aa 
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on effects. However, since we asked for the states to be weak-* continuous functions, 
we also need the channels to be weak-*-weak-* continuous, which we will summarize 
by saying that they are normal. This implies that S* is always the dual of a linear 
map 

which is meant to act on states. As in the finite-dimensional case, it is the map on 
state S that we will call a channel, while S* will only be called its dual. 

If Aa = B{T-Ca) and Ab = B{1-Lb) for some separable Hilbert spaces TIa and 
Tis, then these maps can be represented as 

oo 
k=l 

and 

oo 
k=l 

where the sum can now be infinite [33]. We will call this the Choi-Kraus form of 
the channel S. The elements are bounded linear operators from Ha to Ti^. This 
can be understood starting from the Stinespring dilation theorem for completely 
positive maps between C*-algebras, which states that there is a representation vr of 
Ab on some Hilbert space H, and an isometry V : Hb ^ Ti., such that 

S*{A) = V*7c{A)V 

for all A G Ab- Since Ab = 13(T-Cb), and £* is normal, the representation vr on is 
of the form 7r{A) = A^l. Also, if T^b is separable, then so is Ti |13]. Therefore the 
subsystem on which n{A) acts trivially is also separable, and possesses a discrete 
basis This implies that 

S*{A) = V*(A®1)V = ^V*(A® 

i 

= 5^(F*® K))A(V^® (z|). 

i 

Therefore the elements of the channel S can be chosen to he Ei = V ^ Note that 
this makes it clear that there is a large ambiguity in the choice of the elements Ei. 
Indeed, any orthonormal basis \i) would potentially yield a different set of elements. 

This definition applies for arbitrary von Neumann algebras Aa and Ab- In 
particular, if both are commutative, this simply defines a stochastic map between 
classical probability distributions. 

2.3.3 Observables 



Given that a classical system is also modeled by a von Neumann algebra, we will 
simply define an observable to be a channel (in the above sense) from a classical 
system to a quantum system, i.e. 
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Figure 2.1: The classical position observable is defined by the positive map Q* which 
sends a function on M, which represents a proposition about the position, to a function 
on M? representing a proposition about the full phase-space. The interval on M here 
represents the characteristic function for this interval, which is a sharp effect. It is 
mapped to the characteristic function for an infinite stripe in phase-space. 



Definition 2.2. An observable X on the system specified by the algebra A and 
which takes values in the measure space Q, is specified by a unital positive normal 
map X* : A, or equivalently by a trace-preserving positive linear map 



of which X* is the dual . 



In this case the complete positivity comes for free as in the finite-dimensional 
case |13]. Note that in general, A* is not equipped with a trace like the one defined 
for Bt{H). Hence the trace which is preserved in the general case is Tr(p) := p(l). 

This definition also holds when A is commutative, where it defines a classical 
observable. Note that a classical observable is identical to a classical channel, i.e. 
a stochastic map. For instance, consider the case where A = L°^(]R^), which could 
describe a single classical particle with phase-space parameterized by position- 
momentum pairs {q,p) G M^. The position observable is then represented by the 
map 

Q* : L°°(M) L= 



which sends a function a on M to the function (3 = Q{a) defined by P{q,p) := a{q)- 
Hence it says how a question about position can be translated into a question about 



phase-space (see Figure 2.1). 



This notion of an observable corresponds only to a subset of the positive operator 
valued measures. In order to obtain a POVM E from the observable X, it suffices 
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to apply X* to the characteristic functions x{^) of each subsets u C Q. Hence the 
POVM E corresponding to X is given by 

for any u O Q. This is indeed a POVM since 

for any disjoint sequence of subsets Uk C fi, and also 

E{n) = X*{1) = 1. 

In order to see that this does not yield every POVMs, consider the case where A 
is one-dimensional, which is to say ^ = C. Then a POVM E on Q with values 
in ^ = C is any positive measure on Q. Now suppose that X is an observable on 
^ = C as defined above. Then it is a linear map from C to which is just 

an element a G L^{Q) such that X*{xuj) = a{x) dx. Hence X can only represent 
measures which are absolutely continuous with respect to the measure associated 
with f2, contrary to a POVM which could represent any positive measure. This 
means in particular that, if we were to take f2 = M then we could not represent 
any discrete POVM. However this can be overcome simply by choosing instead the 
disjoint union = M U N. 

Note that we could have modeled all POVMs if we did not require that the 
channel representing the observable be weak-* continuous. Indeed, a POVM can 
be seen as a unital positive linear map 

E ■ CaiVl) A 

where Cq{VL) is the C*-algebra of continuous functions on with compact support 

The fact that our notion of observables is more restrictive than the usual def- 
inition in terms of POVM is desired, because it makes the set of all observables 
easier to handle. In particular, if A is finite-dimensional, then the set of all the 
observables with value in a fixed set VL is separable. 

2.3.4 Sharp observables 

As in the finite-dimensional case, an observable X will be called sharp if it maps 
all the sharp classical propositions x^, C i7 to sharp quantum propositions 
{X*{xu))Y = X*{Xuj)- This implies that the corresponding POVM is in fact a 
projection valued measure (PVM). A PVM is also known as an orthogonal parti- 
tion of the unity. It can always be seen as representing the spectral measure of a 
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self-adjoint operator A. If Q — R, and we define the PVM Il{uj) :— X*{xu}), we 
have 




X Il{dx) 



This notation can be understood intuitively as meaning that we define the integral 
as the limit of discrete sums with intervals Ax. Il{dx) can then be conceived 
intuitively as the hmit of the operators 11 (Ax). 

If the sharp observable takes value in a set Q 7^ R, representing it as the 
spectral measure of a self-adjoint operator A would require a measure-preserving 
map / : O ^ R. We could then write 

A:= [ f{x)U{dx). 

This is due to the fact that an observable represented by a self-adjoint operator 
takes values in its spectrum, which must belong to R. It suffices to think about the 

case n = R^ to realize that the map / can be pretty messy. This shows that the 
formalism of PVMs can be more convenient than that of self-adjoint operators in 
that it easily allows to talk about observables with non-scalar values. 

Conversely, if we are given the self-adjoint operator A, then the corresponding 
PVM is given by the L°°(r2) functional calculus of A, where is the spectrum of 
A. This means that for any classical proposition a e L'^{Vl), we obtain the effect 
X*{a) through the relation 

X*{a) = a{A). 

The map X* is a von Neumann algebra isomorphism. In particular it satisfies 

X*{a(5) = {a(5){A) = a{A)(5{A) = X*{a)X*{(3). 

The image of X* is a commutative subalgebra of the von Neumann algebra on 
which it is defined, namely the von-Neumann algebra generated by ^4, which is a 
faithful representation of the algebra L°°(Q). 



2.3.5 A category 

One advantage of the formalism chosen is that classical systems and quantum sys- 
tems are both modeled in the same way, as von Neumann algebras, and all the maps 
of interest are normal completely-positive unital maps between these algebras. In 
the dual picture, we are working with the pre-duals of von Neumann algebras and 
the trace-preserving completely positive maps between them. We will generally 
make use of the maps between pre-duals since they directly say how states evolve. 
Intuitively, the right direction of the flow of information is from states to states. 
Indeed, states represent the information that we, the observers, possess about the 
systems, and it is our information which flows from one to the other. 
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Hence most of our discourse will take place in the category whose objects are 
the pre-duals of von Neumann algebras, and whose morphisms are trace-preserving 
completely positive maps between them [44j. For instance, an observable X of a 
quantum system represented by the Hilbert space 7i is a morphism 

where L^{Q) contains the states of the classical pointer which represents the value 
of the observable. Of course, quantum channels are also morphisms: 

Bt{HA) — BtiHs) 
States themselves can be represented by the morphisms 

c — ^ Bt{n) 

Indeed, all that this map p can do is send each complex number z to z times a fixed 
element, say p, of Bt(H): z i-^ zp. Since p is positive, p must be a positive operator. 
In addition, the fact that p is trace-preserving implies Tt{zp) = zTr(p) = Tt{z) = z, 
i.e. Tr(p) = 1. 

As an exercise, note that the trace is the only morphism from BtilH) to C. It 
amounts to erasing all information. 

Tr 

Bt{H) ► C 

Effects can be represented as observables with pointer Vt = {0, 1}, i.e. with target 
Li({0,l}) = L({0,l})^C2: 

At this point we may wonder about the possible meaning of a morphism (j) : 
L^{Q) —y Bt{Ti.), i.e. a flow of information from a classical system to a quantum 
system. Since it is linear it must be defined by its action on a basis of L^{Q). 
Consider for instance the case where fl is finite. Then a basis is given by the pure 
states XiU) = ^ij for S'll hj ^ ^- Since these functions are states, the images are 
quantum states pi := which are completely arbitrary. Hence is a state- 

preparation device. It encodes classical information into a quantum system by 
directly mapping the pure classical states into arbitrary quantum states: 

L\Q) Bt{H) 

The composition of a state preparation and an observation yields a channel 
between two classical systems: 



L\nA) Bt{n) — - L\nB) 



Summary 
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Hybrid 


Classical 


Quantum 


Algebra is ... 


arbitrary 


commutative 


factor of type I 


dimension 


arbitrary 


finite 


arbitrary 


finite 


arbitrary 


Algebra 
Pre-dual 


A,B,... 
A B 


L{Q) 

Lin) 




M{n) 
M{n) 


B{n) 
Bt{n) 


Effects 
States 


E(^),E(i3),... 
S{A)MB),... 


1 


E(^]) 


E{n) 



Table 2.1: Summary of the symbols used for sets of propositions and states. 



which is simply a stochastic process mapping a probability distribution on VLa into 
a probability distribution on ^2^. For instance we get the classical capacity £ by 
maximizing the Shannon capacity of the classical channel 

L\nA) — BtiHA) ► BtiTis) L\Qb) 



over all states preparations and all observables X. 



2.4 Summary 

In this chapter, we have seen that we can model quantum and classical systems in 
terms of their set of effects, or equivalently in terms of the dual set of states. Effects 
represent elementary propositions and states assign a probability to each effects. 
In both the quantum and classical cases, the effects can be seen as elements of a 
C*-algebra. The essential aspect of these algebras, for our purpose, is the fact that 
they are ordered vector spaces, i.e. vector spaces with a suitably compatible partial 
order. The effects are those vectors which are larger than zero and smaller than 1. 
For our purposes, it is convenient to assume that the C*-algebras have a pre-dual, 
which means that they are von-Neumann algebras. We then restrict states to those 



functionals which live in the pre-dual. Table 2.1 summarizes some of our notations. 

A transfer of information, or channel, from system A to system B is represented 
by a map from the states of A to the states of B, whose dual is a unital completely- 
positive map from effects of B to effects of A. For clarity, it is always the map on 
states which will be referred to as the channel, since it goes in the direction which 
we intuitively associate with the direction of the flow of information. 

A quantum observable, or observable for short, is a channel from a quantum 
to a classical system. The classical system can be interpreted as that part of a 
measurement apparatus which displays the result of the measurement. This notion 
of an observable is essentially equivalent to that of a POVM, although slightly more 
restrictive. 



Chapter 3 

Preserved information 



Recall that our aim is to characterize decoherence through the type of information 
flowing between a system and its environment, or within the system. Therefore, we 
need a way to formally talk in detail about the nature of the information preserved 
by a quantum channel. In this chapter, we introduce the main new concept that we 
will be using, and derive a number of basic results about it. Although our results 
are based on many known results, mostly in the theory of quantum error correction, 
we will present them in a logical, rather than a historical order. Hence we will not 
mention the known results here, but instead show how to derive them from our 
more general point of view in the next chapter. 

The main definition that we will be playing with is the following: 

Definition 3.1. An observable X is said to be preserved by a channel S if there 
exists an observable Y such that X = Y o S. We write P^- for the set of observables 
preserved by £. 



Recall that the equation X = Y o S means that X is the image of Y in the 



Heisenberg picture (see Section 2.1.5). For a discrete observable with elements Xj 



and Yi, this means that Xi 
have: 



S*{Yi) for all i. If f2 is the set of values of X, we 



A, 



S 




Y 



L\n) 

Hence, for any state of system A, measuring Y on system B yields the same result 
(i.e. the same probability distribution on Q) as measuring X on the initial system 
A. This justifies the idea that the information about A represented by X has been 
preserved by the channel S: we can perfectly simulate an observation of X on the 
initial state by observing Y after the action of the channel. 
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If the observable X is sharp, we know that it can also be represented by a self- 
adjoint operator X. As an example, suppose that this observable has a discrete 
spectrum. This implies that it is of the form X = XiPi where Xi is the eigenvalue 
corresponding to the spectral projection Pj. In this case, X is preserved if Pi = 
S*{Yi) for all i, where the effects Yi form a discrete observable, i.e. Yi = 1. 

It is not sufficient to require that X = £*{Y) for some self-adjoint operator Y. 
Indeed, this equation only means that the first moment of X is preserved for any 
state p: Tr{pX) — Tr{£ {p)Y). On the other hand, if we require that Pj = for 
all i, then all the moments of X are preserved. Indeed, for any n, X" — ^^x'^Pi, 
from which it is easy to see that also 

Tr(pX") = Tr(£(p)F") 

for any n. 

Note that we can also define the moments of any scalar (and possible unsharp) 
observable X defined by 

X*(a) — / a{x)Xxdx 
Jr 

where X^ are operators, and a a real function. The nth moment of X can be 
represented by the operator 

[ x^'X^dx. 
Jr 

Indeed, we have 

Tr{pX~^)= ! a;"Tr(pX^) = I x^X{p){x)dx 
Jr Jr 

This equation means that the expectation value of X'^ on p yields the nth moment 
of the probability distribution X{p). If X is preserved, then so are its moments, in 
the sense that X" = S*{Y^) for all n. 

Consider the case of yes-no observables, i.e. observables with two outcomes, 
which are one-to-one with effects. If Xq and Xi are the two elements characterizing 
a yes- no observable, we must have Xq + Xi = 1, which implies Xi = 1 — Xq. This 
observable is preserved by the channel S when there exists another yes- no observable 
Y such that Xq — £*{Yo) and Xi — E*{Yi). However the second equation is redun- 
dant. Indeed, this first equation implies S*{Yi) — S*{1 — Yq) = S*{1) — S*{Yo) = 
1 — Xq = Xi. Therefore, the preserved yes-no observables are characterized by what 
we define as the preserved effects, namely the effects which are image of another 
effect under S*. 

Definition 3.2. An effect A G E(^i) is preserved by £^ : — * ^2 ff there exists 
B G E(^2) such that A = £*{B). Hence, the set of preserved effects is £*(E(^2))- 
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A general preserved observable X satisfies X* = S* oY*, which means that for 
any classical proposition a, X*{a) = S* [Y* [a)) . Hence all the effects X*{a) associ- 
ated with the observable X are individually preserved by the map S. For a discrete 
observable X, this means that any sum of its elements belongs to £^*(E(^2))- 



3.1 A set of observables? 

We introduced a notion of preserved observable, and therefore also a notion of 
preserved effect. Since preserved observables are made of preserved effects, we may 
wonder if we could limit the discussion to the set of preserved effects, which are 
simpler objects. In addition, the set of preserved effects is an effect algebra, which 
can be understood as defining an effective physical system of its own, as discussed 
in Section 

However, the set of preserved effects, although it certainly gives interesting 
information about the channel, does not always contain as much information as 
the set of preserved observables. Indeed, it is not always true that a preserved 
observable is characterized by preserved effects. A counter example is given in 
Section 13.1.11 below. 

However it is true in the "generic case" , when the channel has no kernel: 

Proposition 3.1. // the channel £ : Ai A2 is invertible, then the two following 
statements are equivalent 

1. The observable X is preserved by £, 

2. X*{a) e ^*(E(^2)) for all a. 

Proof. We have already shown above that Statement 1 implies Statement 2. Con- 
versely, suppose that X*{a) G £*(E(A)) for all a. Let := {8*)'^ o X*. We have 
to show that the linear map is positive and unital. Since £* is invertible, its range 
is the whole positive cone of A2 and its inverse maps the positive cone of A2 to 
that of Ai, which means that it is also positive. Given that X* itself is positive, 
is positive. In addition we have that = 1, which implies 0(1) = 1. □ 

3.1.1 A counter-example 

Let us now give an example of a channel, together with an observable which is not 
preserved but which is made of preserved effects. We consider a channel of the form 
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Let i — 1, . . . ,4: denotes a basis of C^. For convenience, we will use its first three 
elements \i) for i = 1, . . . , 3 to define a basis of too. Also, we write Pi :— 
We define £ by 

1 ^ 2 
^(P) :=3El^)(W)(^l + 3TV(p)P4 

i=l 

It is easy to see that this map is completely positive. Indeed the first term already 
has the Choi-Kraus form, and the second term can be written as | Yl^^i |4)(i|p|i)(4|. 
Also it is trace-preserving. Indeed, 

Tr{S{p)) = lj2(t\p\t) + lTr{p) 

1=1 

= ^Tr(p) + ^Tr(p) = Tr(p). 

The dual is 

i=l 

Now consider the discrete observable X on with elements 

X,^^{P2 + Ps) + ^l 
X2 = ^(Pl + P3) 
Xs=l{Pl + P2) 

These elements are clearly positive and sum to the identity: 

i i 

In addition, any sums of the elements Xi are inside £*(E(C^)). To see this it is 
useful to note that S*, viewed as a linear map, has a kernel which is generated by 

K^P^ + P2 + P3-^P,. 

Since S*{K) = 0, we can subtract any multiple of K in the argument of £* in order 
to make sure that the argument is an effect. If we define the effects 

y2°:=Pi + P3 
Yi:^P, + P2 
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then we have Xi — S*{Y^) and also 



K) 



X2 + ^3 — 



S*{Y2' + Y,' 







Xi+Xs 



£*{Yl> + Y^ 







This suffices to show that X*(a) e £^*(E(C^)) for any a. However, the observable 
X as a whole is not preserved by the channel S. In order for it to be preserved, 
we would need to find another observable Y with elements Yi, i = 1, ... ,3 such 
that Xi = S*{Yi) for all i. We note that the effects Y^'^ cannot serve this purpose 
because they do not form an observable. Indeed they fail to sum to the identity: 
y.° = 2Pi + 2P2 + 2P3 + = 2I4 - IP4. If we wish to modify these three 
operators while maintaining their images under S*, the only thing we can do is add 
or subtract different multiples of K to each of them. However it is easy to see that 
this cannot be done without getting them out of the set of effects E(C^). Indeed, 
note that they are all diagonal operators, hence the diagonal coefficient are the 
eigenvalues. It is easy to see that adding or subtracting any amount of K to any 
of them would set some diagonal elements either smaller that or larger than 1, 
which would mean that they are not effects anymore. 

Note that this example is entirely classical since all the operators involved are 
codiagonal. Therefore the counter example also applies to classical channels. 



3.1.2 Preserved effective theory 

As mentioned above, the set of effects preserved by a channel is an effect algebra. It 
can be understood as defining an effective physical system, namely a "subsystem" — 
in a very general sense — of the initial quantum system. When the channel is in- 
vertible, this subsystem has a true operational existence as it effectively models 
the experimentally accessible degrees of freedom of the system filtered through the 
noise represented by the channel. 

However, the above counter-example shows that for more general channels, some 
of the observables represented by this set of effects cannot in fact be measured (are 
not themselves preserved). This implies that, in principle, we should use a more 
general framework than that of an effect algebra, and define a physical theory 
directly by a set of general observables. 

This explains why, in the rest of this thesis, we will always take the set of 
preserved observable as the fundamental object of study, and reduce it to effects 
only when possible. 
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3.2 Sharp preserved observables 

In this section we will consider a quantum channel S : BtiTii) — > Bt(H2)- We will 
also assume that Tii is separable. We saw in Section [2.3.2| that such a channel can 
always be represented in terms of a countable family of channel elements E^, as 

k 

where the sum converges in the weak-* topology. Note that from now on, all sums 
of operators will be assumed to be at most countably infinite, and convergent in 
the weak-* topology. 

Let us consider the sharp preserved effects, i.e. those which belong to £* {E{H2)) ■ 
A sharp effect P G E(7ii) is a an effect which is also a projection, i.e. P^ = P. If 
it is preserved, then there is an effect B G ]E(?i2) such that 

P = S*{B). (3.1) 

In order to characterize the sharp preserved effects, we want to obtain an equivalent 
condition which involves only the channel elements E^. 

First, let us state a simple technical lemma that we will use several times: 

Lemma 3.2. // F^BF^. = for some positive operator B and a family of 
operators F^, then BF^ = for all k. 



Proof. For any vector we have 

J2{i^\F*BF,\i;) = Y,{ij\Ft^* ^Fu\i^) = ^ ||v^F, 



0. 



This implies that each positive term || v-B-Ffc|'?/')|| the sum is zero. Therefore 
^Fk\'ip) = for any vector I?/'), which means that VBFf^ = 0. □ 



We could apply this lemma to Equation 3A if we had a zero on the left-hand 
side of the equation. This is the case if we multiply this term by P^ = 1 — P. 
Indeed P^P = (1 - P)P = P-P'^ = P- P = 0. But we want the right-hand side 
term to be positive for the trick to work, therefore we multiply by P"*" from both 
sides: 

P^£*{B)P^ = P^PP^ = 



Applying Lemma 3.2 we get BEk{l — P) = for all k, i.e. 



BEu = BEuP. 



(3.2) 



There is another similar equation that we can use. For every preserved effect P we 
get another preserved effect for free, namely its orthocomplement: 1 — P. Indeed, 
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S*{1 — B) = 1 — £*{B) = 1 — P, and 1 — i? is a valid effect. Using the same 
argument as above, we obtain (1 — B)EkP = 0, i.e. 



EkP = BEkP. 



Combining Equations |3.2| and |3.3[ we get 

BEk = EkP. 



(3.3) 



(3.4) 



This means than the existence of an effect B which is such that BEk = EkP for all 
channel elements Ek is a necessary condition for P to be preserved by the channel. 
In fact, it is easy to see that this condition is sufficient. Indeed, if BE^ = E^P, 
then 

£*iB) = Y^ElBEk = Y^ElEkP = S*{1)P = P 



We achieved obtaining a condition which depends on each channel elements 
Ek independently, rather than on the whole channel. However, if we really want 
a useful characterization of the sharp preserved effects, we need to eliminate any 
reference to the unknown operator B. First note that by taking the adjoint of 
Equation 3.4 we obtain E^B = PE^. Together with Equation 3.4, this implies 

E'^EjP = E'^BEj = PE^Ej. 

This is a necessary condition for P to be preserved, which does not involve the 
unknown effect B. In fact, this condition is sufficient. 

Theorem 3.3. A sharp effect P is preserved by the channel S{p) = 'Ylk^kpEt ^/ 
and only if 

[P,E:E,] = OforalH,j. (3.5) 



Proof. We have already proven the necessity. In order to prove the sufficiency, we 
will use this condition to build the effect B of which P is the image. We will need 
the completely positive map Sx defined by 

oo 

Sx{A) ■.= Y,^iE^AEl (3.6) 

where A, := 2^*. This choice of A guarantees that the sum converges in norm for any 
effect A since < 1. In fact, any choice of components Aj > which makes 

this sum weak-* convergent for any effect A would be sufficient for our purpose. 
Note that if Aj = 1 for all z, then E\ = which is defined only on trace-class 
operators. 

We will try the ansatz 

B := i£,il))-'£,iP). (3.7) 

First, we have to show that Sx{l) can indeed be inverted. Note that Sx{l)\ip) = 
if and only if {ip\£x{'^)\i^) = since it is positive. If we write the channel in terms 
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of the elements Ek, we obtain a sum of positive terms which must all be equal to 
zero: {^p\EiE*\^|J) = 0. Hence E*\iIj) = for all i. This means that if lip) is in the 
kernel of £^a(1)5 it must be in the kernel of each E*, and therefore be orthogonal to 
the range of each Ei. This shows that we can invert Sx{l) on the range of any of 
the operators E^. In the following we will always be able to assume that (£^a(1))~^ 



operates on the span of the ranges of the operators E^. In particular, Equation 3.7 
is well-defined. 

Now that we have defined B, let us check that S*{B) = P. We have 
BE, = {Sx{l))-'SxiP)E, 

k 
k 

= {£x{l))-^£x{l)E,P = E,P 



which is Equation 3.4 and proves that £*{B) = P. However we have to check that 
B is an effect. Note that BE, = EiP implies ^^XiBEiE* = ^-XiEiPE*, i.e., 
B£x{l) = £x{P), which implies B = £x{P){£x{l))~^. From the definition of 5, we 
also have B = {£xil))-^£x{P)- Hence [{£x{l))-\ £x{P)] = 0, which implies that 

B = £x{l)-'^£x{P)£x{l)-'^>0 (3.8) 

In addition, B < £x{l)^^l£xi'i-)^^ = 1, which shows that it is an effect. □ 

The dependence of i? on P in the above proof is in fact given by the dual of a 
completely positive trace-preserving map, i.e. a channel. Explicitly, this channel is 

n{p) = £:{£x{irh£xii)-h (3.9) 

and its dual 



n*{A) = £x{l)"£x{A)£x{l)--^. 

We will see in Chapter |4] that this channel TZ is the "correction" channel of the 
theory of quantum error correction. This particular way of writing it appeared 
before in [15] for the case Aj = 1 which works when Tii is finite-dimensional. 



Theorem 3^ yields also some unsharp preserved effects. Indeed, given the lin- 
earity of £*, if two effects are preserved, then so is any of their convex combinations 
(which are also effects). Therefore, the convex hull of the preserved projectors is 
entirely preserved. In fact, the continuity of this channel implies that the weak-* 
closure of this convex hull is also preserved. 

Consider the commutant As of the operators E*Ej, i.e. the set of operators 
which commute with them. This set is clearly an algebra. In fact, it is a von 
Neumann algebra, which always has the property that the set of effects it contains 
is the closed convex full of its projectors [46]. Since all the projectors in As are 
preserved, then so are all the effects it contains. This proves the following: 
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Corollary 3.4. The set of effects spanning the von Neumann algebra 

A£ = {Ae BiUi) I [A, E*Ej] = for all 

are all preserved by S. In addition, this algebra contains all the preserved sharp 
effects. 



We can now easily generalize Theorem |3.3| to all sharp observables. 
Theorem 3.5. A sharp observable X is preserved by the channel £ if and only if 

X*{a)eA£ for all a (3.10) 

Proof. The necessity of this condition follows from the fact that if X is preserved, 
then there is an observable Y such that X*{a) = S*{Y*{a)) for all a, which implies 
in particular that all the sharp effects X*{Xiu) ioi u C Q are preserved by S, and 
therefore belong to As. Since the unsharp effects X*{a) all belong to the von 
Neumann algebra generated by the sharp effects X*{xuj), we also have X*{a) G 
As for all a. Conversely, consider the channel Y := X o TZ^ which is clearly an 
observable. If X*{xuj) £ Ae for all u, then we know that = S*(Y*{xui)) for 

all u. By linearity and continuity of S*, X*{a) = £*(Y*{a)) for all a. □ 

This theorem yields a direct interpretation of the whole algebra As if we rep- 
resent the sharp observables by self-adjoint operators in the traditional manner. 
Indeed, remember that we can represent a sharp observable X by the (possibly 
unbounded) self-adjoint operator X whose spectral projection-valued measure is 
u X*{xoj)- They are related through the equation 

X*{a) = a{X) 

which involves the functional calculus on X. 

Corollary 3.6. A sharp observable represented by the bounded self-adjoint operator 
X is preserved by the channel £ if and only if 

XeAs. 

Proof. If X is preserved, then As 3 X*(id) = id(X) = X. Conversely, if X e As, 
then X*(a) = a(X) e for all a. □ 

We have not yet discussed the properties of the operators B obtained by Equa- 
tion 3.8 In fact they are projectors. Indeed, remember that B = Sx{l)^^£x{P) = 
Sx{P)Sx{l)-^ . Therefore 

B'=Sxil)-'Sx{P)SxiP)Sx{ir' 
= £xil)-'£xil)£x{P')£x{l)-' 

= £x{p')£x{ir' 

= £x{P)£x{l)-^ = B. 
This can be used to show that 
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Proposition 3.7. A preserved sharp observable is always the image of a sharp 
observable. 

Proof. Remember that if the sharp observable X is preserved, then it is the image 
of the observable Y given by Y*la) = 7^*(X*(a)) = £x{l)-'^SxiX*{a))£xil)-'^ ■ 
From the above observation, we know that since X*{xuj) is a projector for any 
subset cij C f2, then so is Y*{xuj), which proves that Y is sharp. □ 

3.3 Outgoing information 

If the evolution in time of a quantum system is given by a channel 

£ : BtiHi) ^ Btin2) 

which is not unitary, then it means that the system is interacting with an "envi- 
ronment" . The environment can be any other quantum system, the state of which 
we have ignored in our description. This fact is reflected in the Stinespring dilation 
theorem |13], which states that a completely positive map as above can always be 
represented (via its dual) as 

8*{A) = V*{A®1)V (3.11) 
where is a bounded operator 

such that V*V = 1, i.e. an isometry. Here He can be interpreted as the Hilbert 
space of the environment. The Choi-Kraus representation of the channel follows 
from this theorem by noting that, for any orthonormal basis \i) of the environment, 

£*(A) = J2V*{A^ |z)(z|)F = ^ K*(l ® \i))A{l ® {i\)V 

i i 

Hence the channel elements of S are 

E, = {l®{i\)V (3.12) 

which means that 

S{p) = Y^E^pE* = J](l ® {t\)VAV*{l ® 

i i 

= Tte{VAV*) 
where we have introduced the channel 

Tie ■■ Btin2(S)nE) ^ Btin2) 
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Figure 3.1: Dilation of the channel £, and of the complementary channel £c, represented 
as circuit diagrams. 1-0) is the initial state of the environement, and U the unitary operator 
representing the joint evolution of the system and its environement. The horizontal bar 
ending a vertical line represents a partial trace. 

defined by 

Tr^(p) = J](l®(z|)p(l®K)). 

i 

This channel is called a partial trace, here taken over the environment He- It 
corresponds to the action of "erasing" any information about the system T-Ce- Its 
dual, which we implicitly used above, is the ampliation map 

TT*E:B(n2) —^B{n2®nE) 

A I — > A®1 

It says how an effect of 7i2 can be also seen as an effect of 7-^2 ®'He- 

Physically, suppose that the environment starts in a known state \'4'e)i and 
that the two evolve according to the joint unitary operator [/, i.e. p ® \i'E){i^E\ 
evolves to U{p ® \iPe){'4^e\)U* ■ Since the initial state of the environment is fixed, 
and uncorrelated with p, we can view this simply as a map on p G -B((7ii), with 
target in i3t(7^2 ®'He)'- 

P^U{p® \i;E){i^E\)U* = U{1 ® |^i^))p(l ® {iJE\)u* 



(See figure 3.1 ) This gives a physical interpretation to the isometry 

V ■=U{1(S)\iIje))- 

To check that this is indeed an isometry, note that V*V = (1 ® {iI)e\)U*U{1 ® 
IVe)) = 1{i1^e\i1^e) = 1. 

The operation which consists in taking the partial trace over the environment 
means that we ignore its state for the calculation of any further expectation value, 
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or probability. Indeed, if, after the interaction, we only plan to measure observables 
of the system, which are of the form A (E) 1, then all expectation values take the 
form 

Tr{VpV*{A O 1)) = Ti{TiEiVpV*)A). 

This interpretation is also straightforward if we stick to the Heisenberg picture, 
where the form S*{A) = V*{A ^ 1)V directly implies that we are only worrying 
about observables of the form A ® 1 on the final joint state of the system and 
environment. 

The characterization of the preserved sharp observables, derived in the previous 
section, can also naturally be expressed in terms of the isometry V rather than in 
terms of the channel elements E^. Remember that the preserved observables are 



those which commute with the operators E*Ej for all i and j. Using Equation 3.12 
we have 

E:E, = V*il^\t){j\)V. 

Hence the span of the operators E*Ej is simply V*{1 ^B(Ti.E))V. This means that 
a sharp effect P is preserved by S if and only if 

[P, V*{1 ® B)V] = for all B G ^(T^e). (3.13) 



In fact, the effects of the form V*{1®B)V have an important physical meaning. 
Indeed, they are preserved by the channel £c defined by its dual 

S*{B) := V*{1®B)V. 

This channel Sc is called the complementary channel to S. As S maps Tii to 7^2, 
Sc maps Til to 7i^. It tells what information flows from the initial states of the 
system Hi to the environment He- Indeed, the effects of the form S*{B) G B{Hi) 
can be simulated by measuring B on the final state of the environment. 



This gives us directly an interpretation for Equation 3.13 



Corollary 3.8. A sharp observable is preserved by S if and only if it commutes 
with all the effects preserved by the complementary channel Sc- Hence, the preserved 
algebra As is given by 

Ae = {A\ [A, B] = for all B G S*{E{He)) }- 



In particular, this implies that the sharp observables preserved by 8 all commute 
with the sharp observables preserved by Sc- Hence the algebra 

Cs := Ae n As^ 

is commutative and therefore characterizes an effective classical system with phase- 
space fl such that 
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This is a detailed form of the no-broadcasting theorem, which is a stronger 
version of the no-cloning theorem. Indeed, the algebra Cs characterizes the sharp 
observables representing information which has been preserved in the system and 
at the same time transfered to the environment. Therefore this information is 
accessible from two different systems. This means that it has been at best broadcast 
(if there are correlations) and at worst cloned (if there are no correlations). Hence, 
the fact that is commutative means that the sharp information which is broadcast 
must always be classical. 



We will argue in Section |5.2| that this gives a way in which a classical system 
can emerge within a completely quantum system. Here the classical system is 
characterized by the only sharp information which is broadcast by an arbitrary 
quantum interaction. A generalization of this result to unsharp observables will be 



the basis of our picture of decoherence, presented in Section 5.5 



3.4 Example: measurements 

Examples involving sharp observables will be given in the context of quantum error 



correction in Chapter 4.1 Since most results presented in this chapter concern 
only sharp observables, let us give here some examples involving unsharp preserved 
observables. 

Consider a channel on B{T-C) of the form[^ 

^(p) = y^,h\i'k){i'k\p\i'k){i'k\ 

k 

where the states {ipk) are normalized, but need not be orthogonal to each other. 
This channel maps an observable Y with values in Q to the observable X = Y o S. 
Explicitly, 

X*{a) =S*{Y*{a)) 

= J2>^i\i^^){^^\Y*i^M^){A\ 
i 

i 

for all classical effects a G L'^{VL). By construction, X* is preserved by £. In fact, 
it is also preserved by the complementary channel £c- Using a basis of 7i, we 

can write the complementary channel as 

ijk 



This is an example of an entanglement-breaking channel 
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If we define an observable Z of the environment by 

Z*{a):=Y,{^,\Y*{a)\m){^\ 

i 

then we have 



a) 



This proves that all the observables preserved by S are also preserved by Sc- 
In this case, this can also be seen by the fact that we can writ^ 

S = J-" o 

for some channel JF, which is given by the expression that we used to define Z from 
Y: 

j'*{A) = j2mm)\^){^\- 

i 

Therefore, for a channel of this form, all the information which is preserved in the 
system also flows to the environment. This means that the whole of the information 
preserved by £ is broadcast, in the sense that this information is accessible from 



both the system and the environment after the interaction. We saw in Section 3.3 



that the sharp observables broadcast in this way must form a commutative algebra. 

Clearly, this is not true of these broadcast unsharp effects. Indeed, it is easy 
to see that the operators Afc|'0fc)(V'fc|) which need not commute with each other. 



are preserved by £ and therefore also by £c. We will see in Section p.3| that this 
information is "classical" , but in a more general way. This is not surprising as the 
channel £ can be relayed by a classical system. Indeed, it could be simulated by 
first measuring the observable F, with elements 

and then preparing the state \ipk) corresponding to the outcome k of the mea- 



surement. Note however that, as we will also see in Section p.5.2[ the information 
broadcast between only two systems need not even be classical in that more general 
sense. 

Another way of looking at this channel £^ is as a pre-measurement of the discrete 
observables F defined above. This means that it describes the effect on the state 
p of a certain measurement of F, if the actual outcome of the measurement is 



traced-over or "forgotten". In the notation of Section 2.1.6, the measurement is 



defined by the partial channels J-fc(p) = Afc|V'A:)('0fc|p|V'fe)(^fc|- Note that F itself is at 



^In the literature, this property is summarized by saying that £c degradahle, while E is said to 
be anti- degradahle 
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best approximately preserved by S, in the sense that coarse-grainings (approximate 
versions of) F are preserved, as for instance the observable with elements 



r can be considered an approximation of T because the distribution \{il^i\ipk)\ is 
"peaked" at i = k. In the limit where the vectors \ipi) are orthogonal, |(V'i|'^fc)| — *■ 
Sik- However, note that there exists non-trivial observables for which \{ipi\ipk)\ is 



independent of k and i (which are the SIC-POVMs described in Section 5.5.3) 



The fact that the observable being measured is not preserved in the system 
illustrates the well-known fact that two successive measurements of a POVM do 
not yield the same outcome for sure. However, F is exactly preserved by the com- 
plementary channel Sc- Indeed, we have 

= h\i^k){i^k\ = Tfc 

This is what is important for £^ to be a measurement of F: the apparatus (environ- 
ment) must gain full information about F. 

It is always true that, if a channel represents the effect of a measurement, then 
its complement exactly preserves the observable being measured. Indeed, recall 
that in Section 2.1.6| we introduced a measurement by postulating that the chan- 



nel describing the evolution of the system be complementary to the quantum-to- 
classical channel defining the observable being measured, namely F. The quantum- 
to-classical channel F can always be seen as a fully quantum channel by embedding 
its target algebra into the full quantum algebra B{TCe)- In addition F is always 
preserved by F itself, since it is the image of the "identity" classical observable. Let 
us illustrate this when F is discrete, with elements Fj. If we represent F as a full 
quantum channel, its action is 

i 

for some orthonormal basis \i) of the Hilbert space T-Ce- The dual is 

i 

which implies that 

r{\^){^\)=T, 

Hence F is the image under F of the sharp observable with eigenstates which is 
the complete "universal" observable on the classical system Q. In fact, F represented 
as a quantum channel is the minimal channel preserving F itself, which makes sense. 
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n = 2 (PVM) 





n = 3 
V 




n 



4 



n 
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oo 



Figure 3.2: Representations of subsets of effects on a qubits in terms of tlie components 
IjCxjC^- The first image is tlie full set of effects. The lower tip is the zero operator, the 
upper tip is the identity operator. All other shapes represent the set of preserved effects 
for the channels defined in Equation 3.14 for n = 2, 3, 4, 5, and Equation 3.151 for n = oo. 



These channels also have the property of being fully decoherent (see Section 



5.4). 



We will say more about this type of preserved information in Section 5^ where we 
will show that the information preserved by an observable represents in a sense the 
most general way that classical information can be embedded in a quantum system. 

An important example of a channel of this type is given by replacing the states 
by the continuous set of coherent states. This "coherent-state channel" will be 
studied in Sections 15.1.11 and 15.41 



of the form 



Figure |3.2| represents graphically the set of preserved effects for a few channels 

(3.14) 



k=l 
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where 

^l^ + l cos(27rA;/n) a, + ^ sm{2nk/n) a,, 

(Tx and (Tz are two Pauli matrices, and the vectors \k) form an orthoginal basis in 
the channel's target space. The case labelled n — oo represents the channel 

Snip) = ^ j^^ \e){^PeWe){e\ de (3.15) 

where 16*) are pseudo-eigenstates of a multiplication operator on the circle, and 

11 1 



3.5 Example: uncertainty relations 

We will use the concept of preserved observable to show that an approximate mea- 
surement of the position of a particle preserves an approximate momentum observ- 
able, where the relationship between the precision of both is given by Heisenberg's 
uncertainty relations. 

Consider the position x of a one-dimensional non-relativistic particle, which is 
the multiplication operator on H — -L^(M). It will be convenient to follow the 
tradition and use the improper eigenstates \x) of x. An approximate measurement 
of this sharp observable can be represented by the unsharp observable X defined 

by 

X*{a) = / a{y)Xydy 

JR 

where the operators Xy are 

Xy^ J dx\G{x-y)\^\x){ 



x\ 



for a smearing function G{x), which could for instance be chosen to be a Gaussian 
centered around zero. Note that since the observable x does not in fact have 
eigenstates, the proper way of defining Xy would be using functional calculus, as 

Xy^\G{x-yl)\\ 

Normalization of the observable X implies 

dy\G{x,y)\' = l. 



The effect on the particle of a measurement of this observable could be given by 
the channel 



£{p) = j EypE; 



dy 
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where the operators Ey are defined by 




dxG{x — y)\x){x\ — G{yl — x) 



Note that since this channel represents the effect of a measurement, we aheady 
know that the information which flows to the environment is precisely the classical 
information represented by the observable X being measured. The channel itself 
however preserves a lots of information. In particular, it preserved exactly the sharp 
position observable. Indeed, we have 



Now suppose that we measure the momentum observable p = after the 

channel has acted. If \p) represents the improper eigenstates of p, we have 





dy dx dx' G{x, y)G{x' , y)5{x — z)S{x' — z)\x){x' 




z){z 




dy I dx G{x — y)e 



,ixp 





dqdq'dydzdz' G{z)e'^'+y'^^P-'i'>G{z')e-'^''+y^^P-''"^\q) {q' 





where G{p — q) is the Fourier transform of G{x — y). This shows that the channel 
preserves an approximate momentum observable P deflned by 




where 




Since G{p) is the Fourier transform of G{x), the second moment of the dis- 
tributions |G(a;)p and satisfy the Heisenberg uncertainty relation. This 
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gives an operational formulation of the uncertainty relation which works for a sin- 
gle measurement, i.e. which does not rely on expectation values as in the original 
formulation. It shows that if an observable is measured with a finite accuracy, the 
information about the canonical conjugate is preserved up to an accuracy given by 
the uncertainty relation. 

Note that the measurement which yields the channel S is naturally generated 
by an interaction of the type 

H — u X <S> P- 

between two Schrodinger particles. Indeed, defining 

Kx := vxp 



we obtain 



n 

dx \x){x\ (g) 22 — ^x'^{vpY 

n 

J dx \x){x\ (g) e"^*^^. 



Therefore, if the initial state of the second system is \ip), the channel describing the 
evolution of the first system is 

S*{A) = (1 (8) (V'Des*^!^ (8) l)e-s*^(l (8) jV')) 

= I dy(l®(V'|)et*^(A®|y)(y|)e-^*^(l®|V')) 

= J dyE;AEy 

where 

Ey= {1® (y|)e-t*^(l ® \^)) = J dx {{y - x)ut\ij)\x){x\. 

By \{y — x)vt), we mean the eigenstate of position with eigenvalue (y — x)vt. This 
has the form of the approximate position measurement where the smearing function 

G{y~x) = {{y-x)ut[(p) 

depends on the initial state of the apparatus. Note that the second moment 
of [(^(a;)^ decreases linearly with time. Hence the measurement's accuracy in- 
creases continuously with time. However this fact would largely depend on the 
self-Hamiltonian of the second system, which we neglected here. 



Chapter 4 

Correctable information 



In the previous chapter, we studied observables which are preserved by a channel 
S, in the sense that they can be precisely simulated by another observable after the 
channel has acted, no matter what the state was. Formally, X is preserved if there 
exists Y such that 

X = Yo£. 

We have seen that when X is sharp, Y can be obtained from X via a channel TZ: 

Y = Xo7Z. 

For these observables, we have 

X = Xono£. 

In this chapter, we will study in greater depth the class of observables which have 
this property, i.e. which are correctable. Correctable observables are rare compared 
to preserved observables, because they require the channel to have special symmetry 
properties. 

We will characterize classes of observables correctable by a common channel TZ, 
and show that they generalize the known quantum error correcting codes |26l |271 [28l 
[2ni EO]- Our main result is a generalization of the Knill-Laflamme conditions [27] 
to infinite-dimensional codes, and hybrid quantum-classical codes. Our approach 
is close in spirit to the first concept of noiseless subsystem introduced in [23] , 
and technically relies on later results on the correctability of noiseless subsystems 
[3T| [32] . Some of the new results presented were published in [H 121 [3]. 

4.1 Quantum error correction 

Let us suppose that some information is sent through a channel S. The aim of error 
correction is to find certain degrees of freedom, the error correcting code (whose 
exact nature we deliberately keep imprecise for the moment), on which the effect of 
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the channel can be inverted. Since the inversion must be implemented physically, 
it must be a valid physical transformation, i.e. a channel. The inverse channel TZ is 
called the correction channel. The fact that a channel S can always be written as 

£ip) = J2E,pE; 

i 

means that we can always assume that the noise is given by a discrete family of 
individual errors Ei. 

In fact we will see that if TZ corrects this channel on some code, then it would 
correct also any channel whose elements span the same operator space as the el- 
ements of S. This fact is important because often one does not know the precise 
channel elements Ei. Indeed, suppose that the system interacts continuously with 
its environment via a Hamiltonian H. The most general form of H would be 

H = '^J,0 Ki 

i 

where the interaction operators Ji act on the system, and the operators Ki act on 
the environment. 

Let us assume that the initial state of the environment is given by some vector 
It is unlikely that we know much about Ki or \ip) given that the environment 
may be very large and complex. However, it is generally conceivable that we have 
a good knowledge of what the operators Ji can be. If £t is the channel describing 
the evolution of the system alone up to time t, we have 

8;{A) = (1 ® (^|)e**^(A ® l)e-'*^(l ® |^)) 

= ^(1 ® (^|)e'*^(l ® \k))A{l (A;|)e-'*^(1 ® |^)) 

k 

= Y,El{t)AE,{t) 

k 

where the channel elements are 

Ej,{t) = {l®{k\)e-''''{l®m 

n 

n jl-jn 

Hence the channel elements Ek{t) all belong to the algebra generated by the inter- 
action operators Ji. At this point, we could try to find correctable codes for the 
class of channels whose elements span this algebra. This is possible, but would in 
fact result in codes which need no correction at all (i.e. noiseless subsystems [2^). 

However, if the time t at which we aim to perform the correction is small enough, 
we can do better. Indeed, suppose that A is some interaction parameter with unit 
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of energy, then the above series is expressed in powers of tX. We see that to the nth 
order in tX, the elements of the channel are in the span of the nth order products 
•Jji ' ' ' J in- Hence, if we correct often enough (in order to limit the value of tA), 
then we only need to find a correction channel and a code for channel elements in 
the span of the operators Jj-^ ' ' ' Jjni ^ < ^ i ^ fixed A^. In this context, the 
operators Jj can be seen as representing independent errors. The goal is to find a 
code as large as possible which corrects as many of these errors as possible. 

For clarity of the presentation we will stick to the simple picture where the 
channel £ is given. However we will keep the more realistic situation in mind and 
check that the correction procedure we devised works not just for the given channel, 
but also for any channel whose elements span the same space. 



4.2 Correctable observables 



Let us introduce a notion of correctability which fits our framework, and then 
later show how we recover, as a special case, the known results of quantum error 
correction. In this section we consider again a channel 

£ : BtiUi) ^ Bt{n2) 

Definition 4.1. We say that an observable X is correctable for £ if there is a 
channel 7^, called the correction channel, which is such that 

X = Xono£. 



We then say that X is fixed by the channel TZo £. 



The operational meaning of this definition is clear. If X is fixed by TZo £, then 
a measurement of X after the application of the channel £ followed by TZ will yield 
the same outcome as if nothing had happened to the system. 

It directly follows from this definition that if X is correctable, then it is also 
preserved. Indeed it suffices to define the observable Y := X o TZ to observe that 
X = Y o £. Preserved observables however are rarely correctable. Indeed, if X is 
correctable then, in particular, the effect X/||X|| is also correctable. This suffices to 
show, for instance, that none of the preserved effects are correctable in the examples 



shown in Figure 3.2, except for the case n = 2 where they are all correctable. 



We have already identified a class of preserved observables which are correctable, 

in 



namely the preserved sharp observables. Indeed, in order to prove Theorem 3.5 



Section 3.2, we introduced the channel defined by Equation 3.9, namely 

n*iA) = £,{l)-'^£,{A)£x{l)-'^ 



which was shown to correct all the preserved sharp observables. 
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Theorem 4.1. Given a channel 8 with elements Ei, and a sharp observable X 
represented by the self-adjoint operator X , the following conditions are all equivalent 

1. X is preserved by £, 

2. X is correctable for £, 

3. [X.EiEj] = for alii, j. 

Note the similarity between this proposition and the one obtained in [15], for 
an initially different concept of "preserved information" . 



4.2.1 Simultaneously correctable observables 

We know that, in fact, the sharp preserved observables can all be corrected with 
the same correction channel. Let us explicitly define this concept. 

Definition 4.2. A set of observables is said to be simultaneously correctable for 
S if the observables it contains are all correctable in terms of the same correction 
channel TZ, which means that they are all fixed hj TZo S. 

Before we go further, let us remark that, contrary to what happened for pre- 
served observables, the correctable observables are entirely characterized by the 
simultaneously correctable effects. Indeed, it is clear that an observable X is fixed 
by the channel if and only if its effects X*{a) are all fixed by JF. Indeed, by 
definition, X = X o JF if and only if X*{a) = J-'*{X*{a)). More generally, any 
set of observables having simultaneously correctable effects are simultaneously cor- 
rectable. 

Let us state what we already know: 
Proposition 4.2. The effects in the algebra 

A={Ae BiUi) I [A, E*Ej] = for all i, j] 



are all simultaneously correctable by the channel defined in Equation 3.9 



As noted above, this implies that the observables X with X*[a) G A are cor- 
rectable by the same channel TZ. We will call A the correctable algebra for £. This is 
justified by the fact that any other algebra spanned by correctable effects is inside 
A. Indeed, such an algebra would be spanned by its projectors, but A already 
contains all the correctable projectors. 

In general, a set of simultaneously preserved effects need not be characterized 
by an algebra. However, if a set of effects is simultaneously correctable, then so 
are any linear combinations which also form effects. Since, in addition, effects 
are self-adjoint, this means that a set of simultaneously correctable effects will 
always be characterized by an operator system S, i.e. a linear subspace S C B{TCi) 



Correctable observables 



65 



which is closed under the *-operation. Hence, if we say that an operator system 
S is correctable, we mean that all the effects in it are correctable. In fact, given 
that we only consider normal channels, then also any weak-* limit of such linear 
combinations will be correctable by the same correction channel. Hence we only 
need to consider weak-* closed operator systems. 

We will now show that many correctable operator systems can be obtained 



thank to Proposition |4.2[ Consider any subspace Ho C Hi, and let 

V -.Ho^Hi 

be the isometry which embeds Ho into Hi. This means that the operator Pq := 
VV*, defined on Hi, is the projector on the subspace Hq, whereas V*V = Iq is the 
identity inside Ho- We can define a new channel So by restricting the channel S to 
the subspace Hq, which, physically, amounts to making sure that the initial state 
is prepared inside Ho- Hence we define 

So{p) ■-= S{VpV*) 

whose dual is 

£*(A) = V*£*iA)V 
This channel has its own correctable algebra: 

Ao = {Ae B{Ho) I [A, V*ElEiV] = 0}. 

This algebra can be naturally embedded in B{Hi) via the isometry V- Indeed, 
the map A i-^ VAV* from B{Ho) to B(Hi) is a normal *-homomorphism. Note 
however that the identity on B(Ho) is sent to the projector VV* = P. 

The algebra \^v4o^* may or may not be a subalgebra of the correctable algebra 
A- If it is not, then its effects are not correctable for the channel S itself. However, 
we will see that it is one-to-one with a family of simultaneously correctable effects 
which do not form an algebra. Indeed, let TZo be the correction channel for So, 
which is a channel from Bt{H2) to BtiHo), and define the set 

So--=S*{ni{Ao)). (4.1) 

Note that So is not equal to because we have used £* instead of in Equation 



4.1[ In fact, we have 

Ao = V*SoV (4.2) 

This subset So C B(Hi) is in general not an algebra, but it is always an operator 
system. We claim that all the effects which are in So are simultaneously correctable. 
Indeed, for any effect A = S* {711(B)) G So, where B G we have 

s*{ni{v*AV)) = s*{n*Q{v*s*{ni{B))v)) 
= s*{ni{B)) 

= A. 
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Hence, the correction map is 

p ^ vno{p)v* 

which is a vahd channel from Bt{'H2) to BtiHi). 

Hence, for any subspace Tio C Tii we can construct an operator system Sq 
whose effects are all simultaneously correctable. In addition, it is clear that all the 
observables which are formed of effects in Sq are all simultaneously correctable. 

This proves the following: 
Theorem 4.3. For every subspace Ti^ C Tii, the operator system Sq defined in 



Equation 4-i is such that all the observables X with X*{a) G Sq for all a are 



simultaneously correctable. 

Let us summarize how to obtain Sq- We restricted the channel to the subspace 
Ho, computed the correctable algebra and the correction channel TZq for the 
restricted channel, and finally set Sq = S*(71q{Ao))- In fact we can be entirely 
explicit. Letting V be the isomorphism embedding Hq into Hi, P := VV* the 
projector on Ho, £\ the regularized channel defined on the whole of B(Hi), and 

K := (^a(P))-^ 
we have that the operator system 

So = {S*{KSx{VAV*)K) I A e B{Ho), [A, V*E*EjV] = for all i, j} 
is corrected on S by the channel 

n{p)=P£liKpK)P. 



We will give an explicit example in Section 4.4. 1[ 

Do these structures exhaust all the correctable observables for S7 Some light 
could be cast on this question thank to theorem 3 of |15], the proof of which has 
unfortunately not been published at the time of writing. The authors state that, 
when Hi is finite-dimensional, the fixed point set of the dual of any channel must 
be made of elements of the form A + J-"* (A) where A belongs to a *-algebra 
inside A4{Ho), Hq a subspace of T^i, and JF* is a fixed channel which is such that 
PJ='*{A)P = 0, where P projects onto Ho- 

If an operator system S is correctable for S, then it is fixed by S* o TZ* for some 
channel TZ, and, according to |45j, made of elements of the form mentioned above: 
A + J^*{A). This means that if V is the isometry embedding Ho into Hi, we have 
V*S*{n*{A + J^*{A)))V = V*{A + J^*{A))V = A, which implies that the algebra 
Ao is correctable for £ restricted to Hq, and therefore belongs to the correctable 
algebra for this restricted channel. This shows that S is of the form covered by 



Theorem |4.3[ which therefore exhausts all correctable observables, at least in finite 
dimension. 

In the next section we will see a reason why, for most applications, it is more 
useful to make sure that the initial state is in the subspace Ho and work only with 
the correctable algebra rather than working with the operator system So and 
no restriction on states. 
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4.2.2 Simultaneously correctable channels 



In section 4.1 we motivated a situation where, of the channel elements Ei, only 



their span is known. It is already clear that the correctable algebra 

A = {A\ [A, E*Ej] = for all i, j} 

only depends on the span of the elements Ei. Indeed, consider a channel S with 
elements Ei and a channel S' with elements Fi = Yljlij-^j where are arbitrary, 
provided that 'Y2iE*Fi = 1. Then it is clear that an operator which commutes 
with the products E*Ej for all i and j will also commute with operators F*Fj since 
they are just linear combinations of the former. This fact tells us that A is the 
correctable algebra for all channels whose elements are chosen in the span of the 
operators Ei. However this fact alone would not be very helpful if the correction 
channel itself depended on the particular choice of channel elements. Fortunately, 
it does not. 

In fact, we have already exploited part of this freedom in defining TZ. Indeed, 
remember that we defined it in terms of the map 



£x{A) = J2^^E,AE* 



defined on B{Hi) (see Equation 3.6). The sequence A was chosen so that the infinite 
sum in the expression for Sx is well defined for any operator. However the exact 
value of the components Aj did not matter in the proof that TZ corrects the algebra 
A for the channel S. In fact, the only important aspect of this channel was that its 
elements are linear combinations of the adjoints of the elements of S. 

The channel TZ' correcting S' would be defined in the same way in terms of the 
channel 



S',{A) = J2^^F^AF: 



= XiiijiikEjAEl 

ijk 

which has also the right form for the corresponding correction channel 

^'(p) = (^;r((^;(i))-^p(^;(i))-^) 

to correct the channel £. We will not go through the proof that TZ' corrects £ on 
A, since precisely the same steps can be followed as for 7Z itself. 

Hence we have seen that all the channels whose elements span the same space 
of operators will have the same correctable algebra, and be correctable through the 
same correction channel. This means that this theory can be applied to the case 



described in Section 4.1, where the span of the elements is all that we know about 
the channel. 
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There is a sense in which it is this fact which allows for the quantum errors to 
be understood as being discrete [35]. Indeed, a standard error model for quantum 
computing is that where the system considered is a tensor product of qubits, namely 
two-dimensional quantum systems. The possible "errors" (i.e. possible channel el- 
ements of the noise) are supposed to be any operator acting on no more than n 
subsystems, where n is fixed. It is clear that this set of errors is continuous. How- 
ever, for a finite number of qubits their span is separable (in fact finite-dimensional), 
which means that it suffices to choose a discrete set which spans the space and try 
to correct these only. 

This discussion applies to simultaneously correctable sets of observables charac- 
terized by an algebra, which are the correctable sharp observables. However, it does 
not apply to the classes of simultaneously correctable unsharp observables identified 
in the previous section. Indeed, in those cases the correctable operator systems Sq 
may be different for two channels whose elements span the same operator space. 
Indeed, remember that 

5o = r(7^s(A))• 

where is the correctable algebra for the channel restricted to a subspace TCq, 
and TZo the corresponding correction channel. Therefore, although both TZq and 
^0 would be the same for both channels, the set So in this expression depends 
explicitly of the action of the channel itself, and may be different in both cases. 



4.2.3 Nature of correctable channels 

The fact that A is the correctable algebra for the channel S, with correction channel 
TZ, implies that the map £* o TZ* acts simply as the identity on A: 

(r o7^*)U = id^ 

The following theorem elucidates what happens to the algebra A prior to its cor- 
rection: 

Theorem 4.4. Let A be the correctable algebra for a channel £. Then £* is a nor- 
mal *-homomorphism of the algebra generated by the pre-image of A. In particular, 
for any operators B, B' such that S*{B),S*{B') G A, we have 

S*{BB') = S*{B)S*{B'). 



Proof. Remember, that the projectors P in the correctable algebra A satisfy BEi = 
EiP for some operator B, and for all i (see Equation 3.4). It can be directly checked 
that this is also true of the span of the projectors, which is almost the whole of 
the algebra A, up to closure. Now consider two operators B and B' such that 
A := S*{B) and A' := S*{B') belong to the span of the projectors of A. We know 
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that BE, = E,A and B'Ei = EiA'. This imphes that BB'Ei = BEiA' = EiAA', 
from which it follows that 

S*{BB') = ^E* BB'Ei = = 

i i 

= £*{B)£*{B'). 

Since £* is weak-* continuous, this condition also applies to the weak-* closure of 
the set of operators whose images are in the span of the projectors of A. This shows 
that the above condition holds for every operators in the pre-image of A. □ 

Note that the pre-image of A under £* includes in particular the image of the 
dual of any correction channel IZ. 



In fact, the correction channel defined in Equation 3.9 is itself a homomorphism. 
We saw in the above proof that for all operators A in the span of the projectors of 
Ai we have TZ*{A)Ei = EiA. This implies that 

n*{A)£x{l)=Sx{A). 

which means explicitly 

{S,{l))-h,{A){S,{l))-^=S,{A). 

or, simply, 

[{S,{l))-KSx{A)] = 0. 

From the weak-* continuity of Sx, we have that this is true for all A & A. Using 
this fact, and also recalling that [A, E*Ej] = for all ^ G ^, we have 

n*{A)n*{A') = (£,(i))-i^,(A)(£:,(i))-^£,(A')(^A(i))-^ 

= {£,{1))-IS,{A)S,{A'){S,{1))-'^ 
= {S,{l)rlS,il)S,{AA'){S,{l))-'^ 
= {S,{l))-'^£,{AA'){£,{l)r'^ 

= n*{AA'). 



Hence, we proved the following proposition: 

Proposition 4.5. The correction channel given by Equation \3.9\ is a faithful rep- 
resentation of the correctable von Neumann algebra A. 



This shows that the effect of a channel on its correctable algebra simply amounts 
to representing it in a different way on the Hilbert space. These results clarify 
certain aspects of |17j. We refer to P] for the precise relationship between our 
results and those of |I7] . 
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4.3 Error correcting codes 



In Section we mentioned that the purpose of quantum error correction was 
to find a "code" on which the channel can be inverted, without defining what we 
meant by a code. In the previous section, we started from the general assumption 
that a code should be a set of simultaneously correctable observables. We have 
then found that for any subsystem Tio of the source Hilbert space Tii, we have a set 
of simultaneously correctable observables characterized by an operator system So, 
or equivalently by the von Neumann algebra which is such that = V*SoV 
where V is the isometry embedding TYq into Tii. The algebra characterizes the 
sharp observables correctable for the channel restricted to the subspace Hq. 

Hence, all the codes that we identified, on which the channel can be inverted, 
are, or correspond to, von Neumann algebras. In fact it easy to build abstract 
examples which yield any possible von Neumann algebra in this way, given that 
all von Neumann algebras can be defined as the commutant of an arbitrary set of 
operators. 

In order to understand the type of information represented by von Neumann 
algebras, we need to know how they look like. 



4.3.1 Structure of von Neumann algebras 

Let us first summarize the representation theory of finite-dimensional von Neumann 
algebras, which are just *-algebras. 

A concrete finite-dimensional *-algebra A, represented by matrices, i.e. opera- 
tors on a finite-dimensional Hilbert space, always has the form 



Af 



(4.3) 



k=l 



where A4nk denotes the full set of matrices on an n^-dimensional Hilbert space, 
and Imj, the identity on an mfc-dimensional Hilbert space. If the dimension of 
the algebra A is D then we have D = Ylk'^^l- '^^^ direct sum of two matrix 
algebras must be understood as the algebra of block-diagonal matrices, with one 
block encoding the first algebra, and the other block the second algebra. Therefore 
the above means that, written clS db matrix of blocks, 



'-mi 







A 



M 



"2 



'-m.2 








M 



riff 



In addition, tensoring a matrix algebra with the identity on another algebra means 
that we are considering matrices which are also block-diagonal, with as many blocks 
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as there are elements on the diagonal of the identity matrix, but such that each 
blocks are all identical, not only in their size, but also in their content. 

For instance, any operator A in the algebra A = {M.2 ® I2) © (-^3) has the 
form 

/B 0^ 
A = B 
\0 

for a two-by-two matrix B and a three-by-three matrix C. 

The block-diagonal structure of A is determined by the form of its center Z{A). 
The center is the set of operators inside the algebra which commute with all other 
elements of the algebra: 

Z{A) = {Ae A\[A,B] = Q for aWBeA }. 

It is a commutative algebra. The center can also be written as the intersection of 
the algebra with its commutant A! which is the algebra composed of all operators 
commuting with all elements of A: 

Z{A) =AnA'. 

For instance, for an algebra of the form Ain ® Im, we have 

Z{Mn ® Im) = (Mn ® Im) H (1„ O Mm) ~ C. 

A von Neumann algebra is said to be a factor if its center is isomorphic to C. Hence 
matrix algebras of the form A^„ Im are factors. 



More generally, if the representation of A is expressed as in Equation 4.3 , then 
the commutant is 



N 

= In, ® Mm, 
k=l 

and the center of A is 

N 

Z(^) = C(1„,®1„J (4.4) 

k=l 

which means that it is composed of diagonal matrices with only different eigen- 
values. If Pk is the projector on the kth block, then this means that a generic 
element C G Z{A) of the center is of the form 



for arbitrary complex numbers c^. The algebra A itself is block-diagonal in terms 
of the subspaces defines by the projectors Pk, in the sense that for all A E A, 



A = J2PkAPk. 
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Hence the center of the algebra essentially tells us what the blocks are in its repre- 
sentation. 

For instance, consider again the algebra A = {M.2 ® I2) © {-Ms)- Typical 
operators A & A, A' ^ A' and C E A have the form 














B 


• 














A = 5 A' = \ do dU C 





where a, b, c,d,x E C, B is any 2-by-2 matrix, and C any 3-by-3 matrix. 

When A is infinite-dimensional, the direct sum must be replaced by a direct 
integral. This follows from the fact that the center can be any commutative algebra, 
which has the form 

Z{A) ^L^{n) 

for some set Q equipped with a measure. It is with respect to this measure that we 
can write 

A^ A{x) dx 
Jq 

Where the generalized "blocks" A{x) are factors, i.e. have a trivial center. If Q 
is finite then we must use a discrete measure, which gives us the direct sum in 
Equation 4A Hence this integral can be intuitively understood as a continuous 
limit of the direct sum. 

Factors come in three main types. Up to now we have been using type I factors, 
which are always of the form B{T-C) for some Hilbert space 7i. Factors of type II 
or HI are more exotic. An example of a factor of type II will be studied in Section 



4.3.2 Standard codes and subsystem codes 

Traditionally, a quantum error correcting code is just a subspaces Tio of the initial 
finite-dimensional Hilbert space Tii, which is assumed to be finite-dimensional |35j . 
The idea is that the channel S is correctable for states in Tio if there is a channel 
71 such that 

mip)) = p 

for all states p which are mixtures of pure states in the subspace Hq. If we introduce 
the isometry V : Hq ^ Hi which embeds Hq into Hi, this means that 

n{S{VpV*)) = VpV* 

for all p G BtiHo), which is equivalent to requiring the existence of a channel 71' 
such that 

TZ'iSiVpV*)) = P 
for all p e Bt{Ho). Indeed, it suffices to pick 7^'(p) = V*7l{p)V. 
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If we define So{p) := £{VpV*), this means that IZ' o£q is the identity on Bt(Ho), 
or equivalently that S* o (7^')* is the identity on B{Hq), which, as we have shown 
matches our conception of correctabihty for the algebra B{T-Iq). Therefore we re- 
cover the framework of standard quantum error correction, for a code Tio, when 
the correctable algebra is BiTio), and the channel is restricted to the subspace Ho- 

In order to complete the comparison, let us check that our correctabihty con- 
dition reduces to the one introduced for standard codes ^27J. The Knill-Lafiamme 
condition states that a standard code represented by the subspace T^o is correctable 
for the channel £ with elements Ei if there exists \ij G C such that 

V*E*EjV = Xijl for all z, j (4.5) 

where V embeds Tio into the source Hilbert space Tii. In our framework, the 
correctable algebra must be precisely the commutant of the operators V*E*EjV 
for all i and j. Since here they are all proportional to the identity on TYq, the 
correctable algebra is indeed the whole algebra of operators on T-Cq. 

A more general framework was also introduced which generalized the notion of a 
code to that of a subsystem code [3ll|32]. In this approach one defines a code through 
a subspace Ho C Tii and a particular subsystem decomposition Ha ®Hb = Ho of 
this subspace. Again, let V be the isometry embedding Ho into Hi. We then say 
that the subsystem Ha is a correctable code if there is a channel TZ such that 

n{S{v{p®T)v*)) = p(^T' 

for any states p G BtiH-A), t,t' G BtiHs)- We want to show that this is equivalent 
to the case where the correctable algebra A, in our framework, is any factor of type 
I, which in this case is 

A = B{Ha) ® 1a. 

That is, assuming that we are restricting the initial state to the subspace Hq. In 
our language, this would mean that 

V*£*{n\X ®1))V = X ®1 (4.6) 

for all X G B(Ha)- Indeed, suppose first that Ha is a subsystem code corrected by 
TZ , then we have that for all X G B{Ha), 

TT{V*S*{n*{X ® l))V{p ® a)) = Tr((X ® l)n{S{Vp ® aV*))) 

= Tr(Xp O r) = Tr(Xp)Tr(r) 
= Tr(Xp) = Tr((X®l)(p®a)). 

This is true for all states p G BtiH-A) and all states a G BiHtiHs))- By linearity 
it follows that V*£*{X ® 1)1^ = X ® 1 for all X G B{Ha)- Conversely, if Equation 



4.6 is true for all X, then for all p G B(Ho) we have 

Ti{XTiB{n{£{VpV*)))) = Tr((X® l)7^(^(Vpy*))) 

= TT{v*s*{n*{x ®i))Vp) 

= Tr((X ® l)p) 
= Tr(XTrB(p)). 
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Since the above equation is true for all X, we have Ti b(JI{S (V pV*))) = Ttb{p) for 
all p G BtiHo), which was shown in |32] to be equivalent to the definition of Ha 
being a noiseless subsystem for S. 

In this framework, the correctability condition reads |3T] 



V*E*EjV = l(^Aij (4.7) 

for an arbitrary set of operators Ajj G B{T-Cb)- This means that the operators 
V*E*EjV for all i and j generate the sub-algebra 1 ® B{T-Cb) of B{Ho), whose 
commutant is indeed B{Ha) ® 1: the correctable algebra defining the subsystem 
code. 

Note that a subsystem code can always also be identified as a standard code, 
provided that we can afford to put stronger constraints on the initial state. Indeed, 
consider the smaller subspace Ti^ formed by the states inside TIq which are of the 
form I'?/') ® |0o), where |0o) is fixed. This subspace is associated with the isometry 

= ® |0o), for which we have 

W*ElE,W={UhiM^)l 

which is just the Knill-Laflamme condition for TYq. 

These results show that the standard codes, as well as the subsystems codes, 
correspond to the case where our correctable algebra is a finite-dimensional factor, 
which is always of type I. Our results yield two types of generalization over these 
codes. Firstly, we obtain a characterization of infinite-dimensional quantum codes 



and continuous classical codes, which will be discussed briefly in Section 4.3.4 In 
addition, we can correct information which is not quantum nor classical, i.e. which 
is represented by an algebra which is not commutative nor a factor. 



4.3.3 Hybrid codes 

We have seen that in our framework, the structure to be corrected, i.e. representing 
the code, can be any von Neumann algebra. A general von Neumann algebra with 
center Z{A) = L°°{Q) is of the form 

/•e 

A= A{x)dx (4.8) 
Jn 

where each A{x) is a factor. If the center is maximal, i.e. Z{A) = A, then the 
algebra A is commutative and each factor A{x) is of dimension one, i.e. isomorphic 
to the complex numbers C. If, on the other hand, the center is minimal, i.e. 
Z{A) ~ C, then the set Q contains only a single element xq and A = A{xo) is a 
factor. 

If A is commutative, then it represents a classical system, which is clear from 
the fact that it has the form L°°{Q). It is then natural to say that if it is a factor, 
it represents a "pure" quantum system. 
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A physical system represented by an algebra, whose structure is given by the 



general form |4.8[ can be understood as being partly quantum and partly classi- 
cal. Indeed, we can consider the center Z{A) = as representing a classical 
system. For each possible "state" x & fl of this classical system, we have a pure 
quantum system represented by the factor A{x). 

For instance, a classical system, represented by L°°(f2), next to a type I quantum 
system, with algebra B{Tt), is represented by 



A = ® B{n) - / A{x)dx 

Jn 



where each factor A{x) is a copy of BiTi). In the more general case however, the 
size and type of the algebra A{x) may depend upon x. 

Generic operators A^B E A{x) dx are of the form 

A= / A^dx and B= / B^dx (4.9) 
Jn Jn 

Where Ax, B^ G A{x) for all x E Q. Their product is simply 

AB = AxBx dx. 
Jn 

An element of the center is of the form 

C = a{x)lx dx 
Jn 

where a G L°°{Q) and 1^. is the identity on Ax- 



If A in Equation 4.9 is an effect, then each operator Ax is also an effect which 
can be interpreted as a quantum proposition which is true conditionally on the 
classical system being in state x. 

If the finite-dimensional case, if a hybrid algebra 

k 

is correctable, then each factor Ak = A^n^ ® Im^ represents a correctable subsystem 
code for states restricted to the subspace Tik projected onto by = Ink ® Imk- 
Indeed, if V is the isometry corresponding to Pk, then VAV* is the representation 
inside A of an operator A G Ak- Therefore 

V*£*{n*{VAV*))V = V*VAV*V = A. 



Therefore, a finite-dimensional hybrid algebra can be understood as representing a 
family of orthogonal subsystem codes correctable simultaneously. 
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4.3.4 Infinite-dimensional subsystem codes 

The derivation of necessary and sufficient conditions for error correction of infinite- 
dimensional algebras is an interesting new result, given that all physical systems 
are naturally modeled by infinite-dimensional systems. In particular, it yields a 
formulation of quantum error correction for systems characterized by continuous 
variables 



A code can be said to be purely quantum if it is represented by an algebra 



which is a factor. In the finite-dimensional case, we have seen in Section 4.3.2 that 



factors represent subsystem codes characterized by Equation |4.7[ or Equation |4.5 
Some authors assumed that this condition would hold unchanged in the infinite- 
dimensional case. For instance, in [IHI the Knill-Laflamme condition was expressed 
for a channel with continuous elements E^^ as 



V*E:EyV = \{x,y)l 

where X{x,y) G C. Our results show that this condition is sufficient. Indeed, it 
implies that the commutant of the operators V*E*EyV for all x and y is the whole 
algebra J^(Ho) on the subspace Ho C Hi. However, in infinite dimension this con- 
dition is no longer necessary since it expresses that the code must be isomorphic 
to A4(Ho), which is a factor of type I. Hence it misses the possibility of correcting 
more general factors. This means that our generalization of quantum error cor- 
rection to infinite-dimensional systems introduces new types of quantum codes not 
previously considered, namely factors of type II and HI. An example of correctable 



type II factor is given in Section 4.4.4 below 



4.4 Examples 

Let us conclude this chapter by giving a few examples demonstrating features of 
the new type of codes that we obtained. 



4.4.1 Operator system 

We can easily construct examples of correctable operator systems which are not 
algebras by taking any algebraic code correctable on a subspace Ho C Hi, and 
using the correction channel to "lift" it to an operator system correctable without 



restriction on states, as explained in Section 4.2.1 As an example, we will take the 
simplest standard code which corrects any random bit flip. This code is defined 
by a two-dimensional subspace Ho ~ of a three-qubit Hilbert system Hi. It 
is chosen such that the whole algebra of operators on the subspace is correctable 
for any channel with elements in the span of the "error operators" {l,Xi,X2,X3} 
where Xi is a Pauli matrix acting on the ith qubit. The subspace Hq is defined by 
the isometry 

V\i) = 
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where i e {0, 1} and we wrote — \i) (8) \i) (8) We also define 

P ■= VV* = |000)(000| + |111)(111|. 
It is easy to check that a correction channel for this code is 

n\A) = VAV* + J2 XiVAV*Xi 

i 

i.e., V*S*{TZ*{A))V = A for all A e MiHo). 

The algebra correctable on the code Ho, as seen embedded in B{Hi) is 

VAoV* = VM{no)V* = {^a,,|m)Oji| : a^j e c} 

In order to proceed, we need a specific error channel. We will pick one which 
corresponds to assigning a probability for the occurrence of each error in the set 
{l,Xi,X2, Xs}: 

^ (p) = Pop + ^PiXipXi. 

i 

Writing Xq := 1 for convenience, the operator system 

1 3 

5o = S*{TV{Ao)) = {Y. PkXkXi\tii){jjj\XiX^, : a,, G c} 

i,j=Q k,l=0 

is correctable by TZ on all states. We should have PSqP — VAqV*. This can be 
seen from the fact that \iii) — P\iii) and PXkXiP — 5kiP- Explicitly separating 
the components respectively inside F^o^* and orthogonal to y^o^^* we have 

So = +YpkXkXi\iii){jjj\XiXk^ : aij e c|. 

4.4.2 Classical channels 

Formally, we only derived our results for quantum channels. However we can deduce 
what happens in the case of a channel between two classical systems. For simplicity, 
let us consider the finite-dimensional case. A classical channel 

TT : L(Oi) ^ L(02). 

can be represented by a quantum channel 

£ : MiL^n^)) ^ MiL^{n2)) 

defined by 

^(P) = Y^ij{j\p\j)\^){i\- 

ij 
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where 

i 

This means that the channel elements are 

In order to make sure that our scheme is entirely classical, we want to check that 
the correction channel TZ is classical. In finite dimension, it can be constructed as 

n{p)=£*{KpK) 

where K :— (£^(1))~^. Clearly K is diagonal in our basis \i), and £* maps diagonal 
operators to diagonal operators. Therefore TZ represents a classical channel. 

In order to find the correctable algebra, note that 

A classical effect a — a}^^k){k\ is correctable if and only if it commutes with all 
these operators, i.e. 

^^/ThjThiak\k){k\j){l\ = ^^TiijTiiiak\j){l\k){k\ ^ij^l 

k k 

<S=^ aj'Kij'Kii\j){l\ ^ ai'Kij'Kii\j){l\ Vi,j,/ 

<(=^ (aj - ai)'Kij'Kii = Vi, j, I 



Hence, the effect a can be corrected if and only if aj — for all the states A;, j 
which are such that there exists i with tTj^ ^ and TTjfc ^ 0. This simply means 
that two states cannot be distinguished from each other when there is a nonzero 
probability for a transition from a common state to both of them. 

Let us see what the correction channel does explicitly. First, note that 
X= (£(l))-i = E7r.,K)(^ri = $^E7r.,)-^K)(^| 

ij i j 

where the sum over i is restricted to the the terms which are such that Tiij ^ 
0. Let us define Ki :— Ylij'^ij- The correction channel on a classical state p — 

i ^3^^^ ik 
Expressed as a stochastic matrix, it has components 
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Note that the stochastic matrix element irji is the probabihty that the channel vr 
would output the state j if the input was i. We see that this correction channel does 
the following: if it is fed with the state j, it randomly outputs any of the states i for 
which TTji 7^ 0. Indeed, it has no way of knowing which one of these was the initial 
state. However it knows that it could not have been any other state. Clearly the 
correction works exactly only if there is no ambiguity, i.e. if j could have come only 
from a single state i. This is why the correctable observables are those which do not 
distinguish between two states which have a non-zero probability of transitioning 
to the same output state j. 



4.4.3 Failed teleportation 

We can view the standard quantum teleportation protocol [50j as an example of 
quantum error correction. Indeed, Bob must find a way to reconstruct Alice's quan- 
tum information from the classical bits she provides, and his half of the entangled 
pair. 

Here we will use our framework to show that if some of the classical information 
gets lost in the way. Bob will only be able to reconstruct a hybrid. 

For teleportation to be possible. Bob and Alice must initially share a pair of 
entangled qubits. This pair is described by the state 

IV') = ^(|o)®|o) + |i)®|i)). 

Beside her entangled qubit, Alice possesses an other qubit in an arbitrary state p 
that she wants to transmit to Bob. The point of the protocol is that she is not 
allowed to send Bob any quantum information. This means that all she can do is 
to perform some measurement on her pair of qubits, and send the classical result 



to Bob (see Figure 4.1). 



The measurement that she must perform is the one defined by the basis of four 
orthogonal maximally entangled states, which can be obtain by applying the Pauli 
matrices to one half of {ip). Hence the observable's elements are 

X, = (1 ® u^mmi ® u*) = \^,){^^\ 

where Uq = 1 and Ui = ai for i G {1,2,3}. Note that U* = Ui. One can check 
by direct calculation that — 1- These states are all maximally entangled 

because they are obtained by local transformations of a maximally entangled state. 
The key property of these states is the following: 

\iji) = {l®Ui)\ij) = {U,®l)\ij). 



The channel from Alice to Bob is: 

£:(p) = (x®id)(p®|^)(7A|) 
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Bob " 



Alice 




Alice 



Figure 4.1: Left: standard teleportation circuit. Time runs from bottom to top. 
The input is Alice's quantum state to be teleported. The first half disk is a maxi- 
mally entangled Bell state of two qubits. The second half disk represents a measure- 
ment in the Bell basis, the output of which is classical information (the outcome of 
the measurement). This classical information together with the second entangled 
qubit are given to Bob who decodes the information by applying a unitary trans- 
form on the qubits, conditioned on the classical information. Right: what should 
Bob do, and how much of the quantum information can he recover, if some of the 
classical information is lost in the way? 
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where id is the identity channel on Bob's half of To expand this, note that if 

we represent the classical target system of X in terms of diagonal quantum states, 
we have, for an arbitrary state p' of Alice's couple of qubits, 

x{p') = J2'^r{x,p')\t){t\ = J2 \^)mp'\m\ 

i i 

Therefore, 

^(p) = ® ^)(p®\^){m\A){i\ ® 1) 

i 

ijk 



2 

ijk 



ijk 

^ i {n\UipUi\rn){n\j){k\rn) ® \j){k\ 

ijknm 

-\Y.WipUi\k) \^{i\®\3){k\ 

ijk 

\ E 10(^1 ® \3){mpUi\k){k\ = \y. ® ^^p^- 



4 

ijk i 

Once Bob receives this state, it is clear that he just has to measure the classical 
system, and apply the inverse of the corresponding unitary transformation in order 
to recover the state p. 

From the point of view of quantum error correction, we can see that the qubit 
can indeed be recovered by computing the operators E*Ej where 

Ei = ]^\i)®Ui. 

Indeed, we have 

ElEj oc ® Ulu, = 5,,ulUi = 

This shows that the whole algebra of operators on the qubit is correctable. 

Note that we can teleport n qubits simply by applying this protocol to each 
qubits in parallel, which requires also n entangled pairs shared between Alice and 
Bob. In this case the channel from Alice to Bob is 

(") ^tM 
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where p is now an n qubits state, and f/j, i = 0, . . . , 4" — 1 are all possible tensor 
products of Pauli operators and identity operators. 

An interesting aspect of this teleportation protocol is that, in a way, the quan- 
tum state to be transmitted is entirely encoded in the classical information being 
transmitted. More precisely, n qubits seem to be encoded in 2n classical bits. But 
how is this encoding done? We will try to clarify this question by answering a more 
precise one: what aspects of the quantum information do we loose if we destroy 
some of the classical information? 

The most general way of destroying some of the classical information transmit- 
ted from Alice to Bob consists in applying a classical channel vr to it (see Figure 



4.1). The channel elements of the classical channel are (see Section 4.4.2) 



so that the overall channel from Alice to Bob has elements 

The relevant operators for determining the correctable sharp observables are, drop- 
ping the unimportant factors |, 

Therefore, the correctable sharp effects are those commuting with the operators 



for all i, j, k. 



An effect may commute with one of these operators for one of two reasons. 
Either it is invariant under the corresponding transformation, or 



Remember from the classical error correction example (Section 4.4.2), that this con- 
dition on vr means that the classical state j cannot be distinguished from the state 
k because of the noise. Hence, whenever the noise erases the distinction between 
two classical states j and k, the observables that Bob can recover must be invariant 
under the unitary transformation Uj"'''uj[^\ This result can be understood as im- 
plying that the classical information encodes how Bob's qubit must be transformed 
in order to recover Alice's state, which is in fact precisely how Bob effectively inter- 
prets it. If the information about which transformation to apply is lost, then the 
observables which can be recovered are those which do not distinguish between the 
outcomes of the two transformations. 
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For instance, in the case n — 1, suppose that we loose the distinction between 
j = and k = 3, which corresponds to the unitary operators 1 and (T^. The 
correctable observables must then commute with ct^, which implies that they must 
be diagonal. Therefore they only represent one single bit of information from Alice's 
qubit. However, if we also loose the distinction between the states 1 and 2, the 
correctable information is exactly the same. Indeed, the new symmetry that must 
be imposed in this case is a^o-y oc (7^. This shows that our error correction technique 
misses some information in the former case. 

Recall that the correctable information is characterized by the sharp preserved 
observables. In order to see what happens in more detail, we would need to look 
at the preserved unsharp observables. For simplicity, let us simply consider the 
preserved unsharp effects. Note that an effect of the hybrid quantum-classical 
system of Bob is of the form 

B = Y,(^S){^\®Bi. 

i 

Its image in the Heisenberg picture is 

We can always choose /5i = /32 = 1, and Bi and B2 in order to obtain any effect for 
the second and third term. However, the term 

Bq + azBoaz = 2^ \i){i\Bo\i){i\ 

i 

is always diagonal. Therefore we can write all the preserved effects as 

A^-D+-B 
2 2 

where B is an arbitrary effect, and D is any diagonal effect. The further loss of 
the distinction between the states 1 and 2 would also force the second terms to be 
diagonal, so that only diagonal effects are then preserved. 



4.4.4 Irrational rotation algebra 

Consider the algebra generated by two elements x and p satisfying the canonical 
commutation relations 

[x,p] = il. 

This algebra can be represented onTi = L^(]R), where the position operator x acts 
on a function ip e L^(M) as {xip){x) = xip{x) and the momentum p as {pip){x) = 
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Suppose that this system interacts with an environment through a Hamiltonian 
of the form H = J2i Ji ® Ki, where the operators Ji act on the system, and the 



operators Ki on the environment. We have seen in Section [4yT] that this imphes that 
the channel elements of the resulting channel on the system belong to the algebra 
generated by the operators Jj. 

We will now suppose that the interaction operators Jj are of two forms. Some 
are periodic functions of x, with period L^,, and others are periodic functions of p, of 
period Lp. This implies that these functions are linear combinations of powers of the 

functions x ^— e*^^ or e^p^ respectively (their discrete Fourier components). 
For convenience, let us define 



2ti , 27r 

f^x := y- and Up := — 



P 



The von Neumann algebra generated by the interaction operators is also generated 
by the two unitary operators 

U = e'^^^ and V = e'^^^^ 



In addition, we assume that the real number 

■~ 27r 

is irrational. This number is important because it enters into the commutator of U 
and V: 



In principle, in order to find the correctable algebra, we need to find the op- 
erators commuting with the products E*Ej of the channel elements Ei. However, 
since we only know the span of these operators, we cannot exclude that E^ = 1 
for some i. If this is the case, then these products include E^ and E* for all i. 
Therefore, for an effects to be correctable, it needs to be in the commutant of the 
von-Neumann algebra generated by the operators Ei for all i, which is the same as 
the von Neumann algebra generated by the interaction operators Jj. 

The operators 

U' = e'^^ and V' = e'^^ 
commute with both U and V. To see that U' commutes with V, simply note that: 

U'V = e'^VU' = e^'-^VU' = VU' . 

Similarly, V also commutes with both U and V . In fact, the von Neumann algebra 
generated by U' and V is the whole commutant of the algebra generated by the 
interaction operators Jj [SI]. In addition, it happens to be a factor of type II, and, 
together, with its commutant they generate the whole of B{TC). 
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Therefore, this is an example of a correctable factor of type II. In fact, it is also 
noiseless [23] , in the sense that the correction channel can be taken to be the identity 
channel, i.e. no active correction is needed. This happens simply because 1 was 
assumed to be among the channel elements. Indeed, we saw that it implied that 
the correction operators had to commute with the channel elements themselves. 
Hence, if S is the channel, we have S*{A) = S*{1)A = A for all elements A of the 
correctable algebra. 

Let us see how we can understand this "type II subsystem" , and how it resem- 
bles, and differs from, the factors of type I with which we are familiar. If we were 
dealing with a factor of type I containing the identity, then the Hilbert space would 
take the form H = Hi®H2, so that our algebra would be simply B(Hi) (S) 1. In this 
sense, a factor of type I defines a subsystem in the traditional sense. For instance, 
consider H = L'^{R^) = L'^{R) L'^{R). If ^ e L'^{R'^), the operators in the first 
factor A = i3(L^(M)) 1 are those which act only on the first component of ip. For 
instance, the first factor is generated by the operators (£■?/') (x, = xip^x.y) and 
{p'ip){x,y) = -^ip^x^y). Note that here the set on which the states are defined 
can be understood to be the set of joint eigenvalues to the position operators in A 
and A!. 

Something similar happens for our factor of type II. Let A be the factor gen- 
erated by U and V , and A! its commutant, which is generated by U' and V . Let 
us see if we could see the elements of Ti as wavefunctions over the eigenvalues of 
U and U' . First, note that the spectrum of both these operators is a circle (repre- 
sented in the complex plane). These two operators being functions of the position 
operator x, we may want to use the fact that the states of Ti can be represented 
as wavefunction over the spectrum of x, i.e. elements of L^(]R). Indeed, we can 
naturally convert an eigenvalue x of x into the eigenvalues 

a = e^'^^^ and 6 = e^"^^ (4.10) 

respectively of U and U' . In fact, this relationship is invertible. Indeed, if we are 
given a and 6, then only a single real number x will satisfy both these relations. 
Indeed, suppose that we had two different real numbers x and x' yielding the same 
values of a and h. This would imply that they are related by x — x' = 27171 /uj = 
27rm6/LJ for two integers n and m. But this would imply 6 = n/m, which is 
not possible since we assumed 6 to be irrational. This implies that for each state 
ip e L^(]R), we can define the function 

ipia, h) := ipi^x) 



where x is the unique real number related to a and h via the Equations 4.10 Note 



that this function i}) is defined only on the valid couples (a, h) related to some x G 



via Equation 4.10 However, due to the irrationality of 6', these couples are dense 
in the unit torus. We can therefore think of as being defined almost everywhere 
on the torus. We will indeed see that it can be interpreted, in a suitable sense, as 
the wavefunction of a particle on a two-dimensional torus. 
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The relation between the wavefunctions '^{a,h) and the factors A and A! is 
given by the fact that they "act" respectively on the first and second arguments of 
■0 respectively. Indeed, we have 

which means that it acts just like the first component of the position of the particle. 
Similarly, 

(C/V)(a,6) = = e'^'^^ix) = b^{a,b) 

The action of V is also easy to compute: 

iVip){a,b) = {Vij){x)^ij{x + u}p)^ij{ae^''''^,b). 

Hence, its effect is to rotate the first argument by the irrational angle 9. Similarly, 

{V'i^){a,b) - {V'ip){x) ^ ip{x + Up/e) ^ ip{a,be^'''^). 

Although it looks like a particle on the torus, this system differs from it by the 
nature of the normalized states. Indeed, the norm is 

Uf^ J \iP{x)\^dx^ J |V^(e''^=^^e''^^)|'dx. 

What we have done is to take a standard particle in a one- dimensional space, and 
wrap its space around a torus in a dense trajectory. If we view the particle as a 
wavefunction tjj on the torus, its norm is an integral over this dense path. This 
picture illustrates what the noise does. It disturbs only the first component of the 
position of this particle, but not the second. 



Chapter 5 
Decoherence 



In this chapter we present three different models of decoherence. The third one 
being, in a way, a combination of the first two. The first model (Section 5.2) 
focuses on sharp observables, and is essentially based on our results related to 
quantum error correction. It will serve as a guide for our intuition as to what 
happens in the more general case involving unsharp observables. The second model 
(Section 5.4) focuses on the system itself, ignoring the environment, and defines 
the notion of an unsharp pointer observable. The third, more complete picture 
(Section 5.5) considers the exchange of unsharp information between the system 
and the environment. These results were presented in 

Before we describe these models, we will come back on a point made in the 
introduction, and define what we mean by a classical limit. 



5.1 Nature of a classical limit 

In nature, a classical system is a quantum system which happens to be in a context 
where its behaviour can be successfully modeled by a classical theory. In principle, 
neglecting computational difficulties, we ought to be able to model this system's 
behaviour using its full quantum description as well. This implies that we should 
have in principle two different models which both describe the same physical system: 
a quantum one, specified by an algebra of the form B(H), and a classical one, defined 
by a commutative algebra L°^[Q). Since the quantum theory is more fundamental, 
we should be able to translate every propositions of the classical model (i.e. every 
effects) into a proposition in the underlying quantum language. This means that 
we should have a map of the form 

r* : L°°(fi) ^ Bin) 

which maps effects to effects. In addition, we may expect that a certain knowledge 
about the quantum system should induce back a knowledge about the effective 



classical propositions. We followed the same arguments in Section |2.1.2| when we 
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introduced observables, which yielded the requirement that the map must be hnear, 
positive, unital and normal. The map F* is the dual of a channel F which maps 
quantum states into classical states, just like an observable: 

Hence, we expect that a classical limit, at the kinematical level, is given by 
an observable. Our model of decoherence will take quite literally the idea that a 
classical limit emerges from a "measurement" of the system by the environment. 
In those terms, F is simply the observable being measured. We will refer to F as 
the pointer observable. 



5.1.1 Quantization 

We know that F does not only define a map from classical effects to quantum effects, 
but more generally a map from classical observables to quantum observables. Hence 
it can be understood as defining, in a somewhat restricted sense, a quantization 
procedure. 

In its most general form, a classical observable on a classical system defined by 
the algebra L°°(fi) is a channel 



TT 



For instance, suppose that = is the phase-space for a single continuous vari- 
able system (say a single particle in a one-dimensional space). Then the position 
observable is the map which sends a probability distribution fj^{q,p) over the phase- 



space to its marginal /i' = / IJ^{q,p) dp (see also Figure 2.1 for the action of the 
dual map). But observables need not have only scalar values. For instance, there 
is a phase-space observable, which is simply the identity map on L^{Q). Clearly 
this "phase-space" observable always exists and has a special status: it gives full 
information about the classical system. 



Remember (from Section 2.1.5) that a quantum channel 8 can be used indirectly 
to map an observable X of the destination to the observable Y = XoS of the source. 
The use of the dual map S* is implicit in this expression. Dualizing the equation 
we obtain Y* = S* o X*, which means Y*{a) = S*{X*{a)) for each proposition a. 
The expression Y = X o S is convenient because it expresses in a straightforward 
way that observing Y on the source is done by first applying the channel S and 
then observing X. 

The pointer observable F maps quantum states into classical states, and can 
therefore be used to map a classical observable vr into a quantum observable X via 
the equation X = vr o F. This means that measuring X amounts to first measuring 
the observable represented by F, and then measuring the classical observable vr on 
the classical outcome of F. Equivalently we have X*{a) = F*(7r*(a)). Of course. 
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r itself is trivially the image of the special classical observable represented by the 
identity on Q: F = id o F. 

If two classical observables are mapped to two quantum observables which do 
not commute, then the classical observable representing their joint measurement is 
mapped to an unsharp quantum observable. For instance, the classical observable 
"id" mentioned above, which represents the joint measurement of all classical phase- 
space variables, is mapped to F. 

Note that although F maps any classical observable to a quantum one, and every 
quantum state to a classical one, it does not in general attain all classical states. 
Indeed, the classical states that it can represent by quantum states will always 
satisfy all the uncertainty relations. This is where the "limit h 0" is involved. It 
comes as a second step which consists in artificially removing these constraints on 
states, hence assuming that the classical observations are too inaccurate to notice 
them. 

Let us study this through a standard example. Consider the case of a single 
free electromagnetic mode. This quantum system is characterized by the Hilbert 
space Ti = L^(]R), and its classical limit by a phase-space i7 = M^. We will denote 
the points of phase space by pairs z = {p, q) where q will be called the "position" 
and p the "momentum" . 

In many physically important situations, we know that the classical pure states 
characterized by canonically conjugate pairs z = {q,p) & correspond to the 
quantum coherent states \z). 

Coherent states are pure states in the Hilbert space H — L^(R) which form a 
complete set, i.e. 



These states however are not orthogonal and do not form a basis. For our purpose, 
we need only say that they are generated by acting on a fiducial vector \iJjq) with 
elements of the group of "translations" in phase-space: 

\z){z\^U,\0){0\U: 

where, using the quantum operators g and p satisfying [q,p\ = ih, we defined 

This group is a representation of the group of translations and "boosts" (i.e. trans- 
lations in momentum). Note that this set of unitary operators does not close into 
a group if we consider their action on vectors. Indeed, a non-trivial phase may 
pop up due to the non-commutativity of p and g. However this phase is eliminated 
when the action on mixed states is considered, which is all that matters here. 




where 



dz :— 



dpdq 



2'Kh ' 



90 



Decoherence 



In our language, this classical limit is defined by the observable F, which maps 
a quantum state p to the classical state := r(p) defined by 

fi{z) = {z\p\z). 

The effect of the dual map F* is to send the classical effect a e L°°{fl) to the 
quantum effect 

T*{a) = / a{z)\z){z\dz. 
Jq 

We can now play with this map and see how it transforms observables or states. 

For instance consider the classical position observable Q, which maps a phase- 
space probability distribution li{q,p) to its marginal ii{p) — J fi{q,p)dp. Its quan- 
tum version, Q o T, maps a quantum state p to the marginal of the distribution 
l^{(liP) = {QjP\p\QjP)- Its dual , which operates on effects a e L°°(M), is given by 



{QoVna) 



a{q)\p, q){p,q\dz 



J oi{q) J \p,q){p,q\ 
a{q)Qg dq 



dp 
2mh 



dq 



where we have defined the "effect density" 

Qq ■= J \p,q){p,' 



I dp 



These operators are, in fact, simple functions of the sharp position operator q. 
Indeed, using the position representation, we have 

{x\0,p) = (x|e-^Vo> = e-iP^o{x) = e-iP^^M^). 
which we can use to compute 

dp 



Qq 



\P,Q){P:Q\ 



27rh 



/dp 
\x){x\p, q){p, q\x'){x'\^^dxdx' 

n dp 



{x\p, q)(j), q\x' 



J j {.X- g|p,0)(p,0|a;' - q) 



x){x'\ dxdx' 
dp 



27rh 



x){x'\ dx dx' 



g-ip(x-x') 



dp 
2nh 



{x — q\ipo) {ipo\x' — q) \x){x'\ dxdx' 



J{^- Q\'^o){ipo\x - q)\x){x\ dx = fg{x) 
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where := Kxle-t^^lV'o)!'- 



If the fiducial vector ipoi^) is locahzed at the origin, then we see that the ob- 
servable QoT, characterized by the operators Qg which play the role of continuous 
POVM elements, is an approximate version of the sharp position observable repre- 
sented by g. In fact, it is likely that the set of operators Qq for all q generate the 
algebra generated by q, which essentially identifies q. Of course the same is true 
for the quantization P o F of the classical momentum observable P. 

More generally, consider a sharp real classical observable tt/ represented by a 
real (possibly unbounded) function / on the phase-space R^. The channel tt/ is 



where a is an effect in L°°(R). When composed with F* it becomes the unsharp 
quantum observable Xf with dual 



It is clear that any classical observable can be given a quantum version without 
any "ordering ambiguity" . 

Of com^sc, calling this "quantization" is far from being fair, given that we only 
reinterpreted a known quantum theory with a known classical limit. However, 
the fact that most classical observables, even though they have no sharp quantum 
representation, must have an unsharp one is an important point to make. Also, 
we point out that when only the classical description of a system is known, there 
is a priori no reason to think that certain specific observables will have a sharp 
representation in the quantum theory. That is, unless one is guided by some specific 
experimental results involving, for instance, the quantization of the value of certain 
observable, or some uncertainty relations. Such results are lacking in the case of 
general relativity. 

In addition, we note that, beside the choice of the dimension of our Hilbert 
space, or the type of our von Neumann algebra, the map F really defines the 
quantum theory, given that it is what makes the link between abstract quantum 
observables, which are nothing more than pure mathematical objects, and concrete 
classical observable that we know how to measure and interpret. As we showed, 
specifying F amounts to giving the quantization of all observables. In fact F itself 
is the quantization of the phase-space observable, which represents all that there is 
to know about the classical theory, apart from the dynamics. 

In the rest of this chapter we will attempt to give a realistic picture of the 
emergence of a classical limit defined by an observable F. The hope being that this 
provides clues as to how, given the classical theory and its environment, one may 
attempt to guess the nature of the map F which essentially defines the quantum 
theory. 



defined by 



{n}{a)){z) ^ a{f{z)) 
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5.2 Sharp decoherence 

The results presented in Chapter [3] give us a thorough understanding of the "sharp 
information" preserved by a quantum channel, i.e. the information represented by 
sharp observables. We will now make use of these results to generalize the simple 



example of decoherence discussed in the introduction, in Section 1.3 Our analysis 



will be based essentially on the results presented in Section [33 

We have seen that, if we are given a quantum channel S which describes the 
evolution of an open quantum system, there is a complementary channel Sc which 
describes the flow of information to the environment. The complementary channel 
is unique up to a unitary transformation of the environment. We have then shown 
that the sharp observables preserved by S are precisely those which commute with 
all the effects (sharp and unsharp) preserved by Sc- It follows that the sharp 
observables preserved by both channels must all commute with each other and 
therefore form a commutative algebra C. Concretely, C is the intersection of the 
algebras A and Ac correctable respectively for S and Sc- C := AO Ac- Since it 
is commutative, it characterizes classical information. This shows that the only 
sharp information which has been duplicated is classical. This can be understood 
as a version of the no-cloning theorem, with the addition that it tells us precisely 
which observable of the system represents the classical information which has been 
duplicated, namely the observable F which generates the commutative algebra C- 
Note that F is defined up to a measure-preserving bijective transformation of its 
spectrum, i.e. the phase-space. 

One possible interpretation is the following. Suppose that S represents the time 
evolution of the system of interest. The algebra Ac represents properties of the 
system prior to the interaction. However, the part of Ac which is also preserved in 
the system, namely C, also represents properties that the system possesses after the 
interaction. Therefore C represents the information gathered by the environment 
and which has predictive power about the future state of the system. In the next 
section we will show that each observable in C in fact implies the existence of 
correlations between the system and the environment after the interaction. 



5.2.1 Correlations 

Consider a "duplicated" sharp discrete observable X with effects Xf = Xi E C- 
Since this observable is preserved by both £ and £ci there exists an observable Y of 
the system and an observable Z of the environment which are such that Xi = S*{Yi), 
Xi = Sci^i) ^- Remember that this means that the information about the 

observable X of the initial state of the system is represented by the observable Y of 
the system after the interaction, and also by the observable Z of the environment 
after the interaction. Hence the piece of information that it represents became 
redundant. We will show that the observables Y and Z are indeed correlated, and 
therefore also contain information about each other. 
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We can "purify" the channel 8 with the isometry V such that 

X, = £*{Yi) = V*iYi(g)l)V (5.1) 

and 

X,=£:{Z,) = V*{10Z,)V. (5.2) 

where V = U{1 (g> I'?/')), with the unitary operator U representing the joint evolu- 
tion of the two interacting systems, and lip) the initial state of the environment 
(see Figure [sT ) . These expressions are just the Heisenberg picture version of the 
relations £(p) = ^TenviU{p0\^){^p\)U*) and Sdp) = Trsj,,(t/(p® | )[/*), where 
Treri,^ is the partial trace over the environment, and Tvsys the partial trace over the 
system. 

Consider the first equation. If we multiply the left hand side by X^ on both 
sides, we obtain 

= X^V*iYi ® 1)VX^. 

Since the right hand side is a positive operator, its square root must also be zero, 
i.e. = {^/Yi ® 1)VX^ or, by multiplying by {^/Yi (S> 1) on the left, and recalling 
that X^^ = 1-Xi, 

(Yi ® 1)V{1 - X,) = 0. (5.3) 

Similarly we have X^- = (1— Xj) = V*{{l — Yi)^l)V, which, by the same argument, 
implies 

((1 - Yi) ® 1)VX, = 0. (5.4) 



Combining Equations 5^ and 5^, we obtain 

(F, ® 1)V = VXi. 



Applying the same reasoning to Equation |5.2| yields 

(1 ® Z,)V = vx,. 

Combing the two yields 

(r, ® z,)v = vx^x^ = 6,,vx,, 

from which we have 

V*{Yi0Zj)V = 6ijXi. (5.5) 

This last equation is the result that we were seeking: it means that Y and Z are 
entirely correlated. Indeed, for any state p of the system, 

TT{VpV*{Y, ® Z^)) = Tr(p\/*(F, ® Z^)V) = 5.,Tr(pX,) 

is the joint probability characterizing the joint measurement of Y and Z after 
the interaction, and, as we can see thank to the Kronecker delta, the probability 
that both measurements yield different outcomes is zero. It is straightforward to 
generalize this analysis to the case of continuous observables. We would then obtain 
that 

V*{Y*{a) ® Z*{p))V = X*{a)X*{P) = X*{aP). 
Hence, for all states p, 

TT{VpV*iY*{a) ® Z*{(3))) = Ti{pX*{af3)). 
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5.2.2 Predictability and objectivity 

These results show that the only sharp information that the environment learns 
about system, and which has any predictive power with respect to the outcome 
of some sharp measurement on the system after the interaction is classical. This 
classical information can be characterized by a single observable F: the pointer 
observable (defined up to a measure-preserving bijective transformation of phase- 
space). Hence, in any interaction, some classical degrees of freedom of a quantum 
system are uniquely selected by the requirement that their information be sent to 
the environment, and at the same time, be useful to predict properties of the system 
after the interaction. From the point of view of an experimentalist who observes 
the system indirectly by gathering information from the environment, the effective 
classical system represented by F evolves deterministically. The other degrees of 
freedom of the system can be simply ignored in this picture. 

We note that our use of the notion of preserved, or correctable, information 
allows for the information represented by the pointer state to have in principle any 
non-trivial evolution during the process of decoherence. Indeed, the fact that the 
information is preserved does not mean that it does not evolve. 

This provides a possible model of the process of decoherence. The interesting 
aspect of this picture is that it does not require any assumption about the interac- 
tion. This makes it universal. Nevertheless this model is limited by the fact that F 
is sharp, and therefore cannot account for most classical limits. The next sections 
will address this problem. 

There is another possible way of interpreting the same technical results. First, 
we note that the two channels that we considered need not be the exact complement 
of each other. Indeed, the results hold if the second channel considered only maps 
the system to a subsystem of the environment. Consider the channel defined by 

J^*{A) := V*{1(^A®1)V 

where the second and third tensor factors correspond to two subsystems of the 



environment. Clearly, by comparing with Equation 5^, we have 

which implies that all the information preserved by J-' is also preserved by Sc- 
Hence it is still true that the sharp observables preserved by both JF and S form a 
commutative algebra. In fact this is true of any couple of channels from the system 
to two different subsystems, be it part of the future state of the system or of the 
environment. 

Given this fact, let us consider two channels J-'i and JF2 from the system to 
two different subsystems of the environment. The information about the system 
preserved by both observables is classical and characterized by a single observable 
F of the system. Therefore, a classical limit is selected simply by the condition 
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that it be represented redundantly in the environment. As stressed by Olhvier et 
al. [ini E] (see also [H]), redundancy of representation is an important aspect 
of classical information, because it guarantees the objectivity of this information. 
Indeed, it allows two different observers to indirectly obtain the same information 
about the system, without disturbing each other's observation in any way. This 
slightly different picture of the process of decoherence has the advantage of allowing 
for the emergence of a classical system subject to noise. Indeed, the requirement 
of redundancy uniquely selects a classical limit without requiring the evolution of 
the effective classical system to be deterministic. This will be analyzed in greater 



details for the more general model introduced in Section 5.5 For now, let us 



just mention that this particular model reproduces the main result of [TU], namely 
the selection of a sharp pointer observable through the requirement that it be 
represented redundantly in the environment. However our notion of redundancy is 
stronger because it does not only involves correlations at a given time, but it also 
requires that this information represents a property that the system possessed at 
an earlier time. 



5.2.3 Examples 

Let us show how this model reduces to the simple example mentioned in the intro- 
duction. We considered the channel £ defined by 

i 

where the vectors \i) form an orthonormal basis of a Hilbert space 7i. The cor- 
rectable algebra A for this channel is the commutant of the algebra generated by the 
operators \i){i\3){j\ = Sij\i){i\. These operators simply generate the commutative 
algebras of operators diagonal in the basis This algebra is its own commutant, 
hence we see that the correctable algebra is already commutative, and characterized 
by the discrete sharp observable F with elements Fj = 

In order to find the complementary channel, note that if V is an isometry 
resulting from the dilation of £, then = (1 {(f>i\)V, for some basis of the 



environment (Equation 3.12). This implies that the elements of the complementary 
channel are 

F, = ((^|®1)1^ = 5^((^|®|0,)(0,|)1^ 

j 

Therefore the sharp information preserved by the complementary channel is char- 
acterized by the commutant the operators F*Fj = \i){4>i\'Pj){j\ = which 
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is identical to the correctable algebra for S. Hence in this case A = Ac, so that 
C = A n Ac = A. This shows that the environment gathers information precisely 
about the observable F which is preserved in the system. In this situation, the 
eigenstates \i) of F are the pointer states usually referred to in the literature [5], [9]. 

For a slightly more general example, consider the channel 

S{p)=Y,UP^pP^U* (5.6) 
i 

for a unitary operator U and an complete family of orthogonal projectors Pi. The 
correctable algebra is given be the commutant of the operators PiU*UPj = PiPj = 
SijPi. This algebra is composed of all matrices which are block-diagonal in terms 
of the subspaces defined by the projectors Pi. Hence 

A^^Mn, (5.7) 

i 

where = Tr(Pjt). If any of the dimension is larger than 1, this algebra contains 
quantum information. 

In order to compute the elements of the complementary channel, we will use a 
basis {I^Apjij of the system which is an aggregate of basis of the complementary 
subspaces defined by the projectors Pi, i.e. Pi = \ipj){'ipj\. The elements of the 
complementary channel then are, for some basis of the environment, 

F^, = E \<Pk){{i^plUPk = (5.8) 

k 

Therefore, the algebra correctable for the complementary channel is the commutant 
of the operators 

i^;.F,, = |^})(0,l0,)(^fl = 5,,l^i.)(V^i|. 

These operators precisely span A. Therefore Ac = A', which implies that the 
intersection of A and Ac is the whole center of A: 

C = AnAc = Z{A) 

which is the commutative algebra of operators of the form 

A = Y,^iPi (5.9) 

n 

for any Aj G C. Hence the sharp information preserved in the system and transmit- 
ted to the environment is characterized by the discrete pointer observable F with 
elements Fj = Pi. 

Note that in this case, the concept of pointer state is not adapted to a de- 
scription of the situation. Instead, we have a phenomenon of environment-induced 
superselection rules (einselection) J52j. Here F is the superselection charge. 
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We see that the unitary evolution characterized by the operator U plays no role 

in this analysis. This is because it does not remove or add any information to the 
system. This demonstrates an important feature of our framework, which is that 
it automatically identifies the original nature of the preserved information while 
neglecting any reversible change in its representation. 

We can also look at how the decoherence gradually sets in, when time is intro- 
duced in the previous example. Consider the Hamiltonian 

H = A0B 

where A := X^ili Pi with Aj = and B generates cyclic shifts in the basis 
i = 0, . . . , of the environment, i.e. 

Note that we will be using the symbol i — -\/— T for notational convenience. The 
eigenstates of B are the discrete Fourier transforms of the basis, namely \j) — 
^ Z^jfc where u — The unitary operator for an interval of time t is 

m 



rt>=0 



i 



n>0 



Note that 



-,-it\iB\ 



3 j 

1 ^e--(u.-.)|^^^ 



N 

jk 

Therefore, if the initial state of the environment is |0), the resulting isometry is 
Vt = Ut{l ® |0o)) = ® e-^*^^^(l ® |0o)) 

i 

i ijk 

The elements of the channel St{p) = Trs(VtpV/), describing the evolution of the 
system, can be chosen to be 



Ek(t) = (1 ® {k\)Vt = E e-'^'''^P^ 



98 



Decoherence 



In particular, for t = = T, we have 

Stip) = Ek{t)pEl{t) = E E e-'-''^'-^^] P.pPj 

k ij k 

= Y,SvP^pPj = Y.^^p^^- 

ij i 



Hence at time T we recover the channel defined in Equation 5^ with U = 1. 

Let us now study what happens before time T. At time t < T, the algebra 
of sharp observables preserved in the system is given by the commutant of the 
operators 



which is the algebra A defined in Equation 5/7 Indeed, every single one of these 
operators for k ^ k' generates the algebra spanned by the projectors Pi, of which A 
is the commutant. To see this it suffices to note that for fixed k ^ k', the coefficients 
^-iui(k-k )tXi g^j^g distinct for all i. This shows that the sharp information preserved in 
the system at all time is the same. However, as we will see, the unsharp preserved 
information gets degraded until only the sharp preserved information is left. 

In order to obtain the elements of the complementary channels, we choose as 
before a basis of the system which is compatible with the projectors Pi, i.e. so 
that Pi = Y2j The elements of the complementary channels then are 

nk 

from which we have 

nn'k 

nn' k 
n 

We want to show that for any time < t < T, these operators generate the whole 
matrix algebra. One can check that the coefficient jn' = e«'^"*(-'*i-^i') jg zero if 
and only if t(Aj — Aj/) is an integer. Since Xi—Xe = ^{i—i'), this cannot happen if t/T 
is irrational. Otherwise, if t/T = p/q for some integers p and q < N , then 7jj/ = 
when i — i' = q. If this happens, we are missing the matrix elements of the form 
in the span of the operators F*-{t)Fiiji{t) for all However, this 

element can be recovered by the product \'^}~'^~^'^) 
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From the previous argument, each element in this sum is always present in our 
span, unless t/T is an integer. Therefore, when t < T, there is no non-trivial sharp 
observable commuting with all the operators of the form F*j{t)Fi/j'{t), and hence 
no non-trivial sharp observable flowing to the environment. 

However there are non-trivial unsharp observables flowing to the environment. 
In fact, we can see how one of these preserved unsharp observable becomes sharper 
with time until it equals one of the projectors Pi. Indeed, let us find an observable 
of the environment that one should measure at time T in order to reproduce the 
statistics of the observable with elements P,. At time T the elements of the com- 



plementary channel are given by Equation 5.8 Using these, one can check that the 



sharp observable with elements Qk '■= \(j)k){(l)k\ does the job. Indeed, 



Measuring the same observable of the environment at an earlier time t yields infor- 
mation about the observable of the system with elements 

A^ = {£t):iQJ = i£t):i\<Prn){<Prr.\) 
ijnkn'k' 
i n j 



where we have introduced 



Tim : Ar2 I XI ^ 



iLonitXi—m) 1 2 



iV2' 

n 

As t ^ T, ■Jim converges to 6im, which implies that Am converges to Pm- 



5.3 Classical set of observables 

The picture presented in the previous section is fundamentally tied to sharp observ- 
ables, and cannot describe the emergence of a classical limit characterized by an 
unsharp observable. In particular this means that it cannot describe the emergence 



of a non-trivial phase-space. In Section 5.5 we will see that this picture can in fact 



be generalized to unsharp pointer observables. However it will be instructive to first 
consider a simpler generalization. For the moment we will ignore the environment, 
and attempt to understand what it means for a channel to destroy any information 
but that represented by an unsharp observable. 
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In the previous section, we used the fact that the set of preserved sharp observ- 
ables form a sub-algebra, and that, when this algebra is commutative, it represents 
an effective classical system. This idea does not work with unsharp observables, 
which are not associated with any algebra. Therefore we need to generalize what 
we mean by a classical set of observables. 

We want to understand when the set of observables preserved by a channel, or 
even any set of observables, can be said to characterize classical information. We 
have already defined classical information as that represented by a physical system 
whose set of effects belong to a commutative algebra. Hence, it is a priori natural 
to think of a classical set of quantum observables as that which is characterized 
by a commutative sub-algebra of the quantum algebra. However, we have seen in 



Section 5.1, that there exists much more general ways for a classical system to be 
represented by a quantum system. In general, the representation can be given by 
a channel mapping quantum states into classical states, or equivalently by its dual 
which translates classical effects into quantum effects. This map, which is simply 
an observable, does not in general preserve the full structure of the commutative 
algebra. Note that when it does, i.e. when it represents a sharp observable, it 
singles out a commutative sub-algebra of the quantum algebra. If however it is 
not a homomorphism, then it may map sharp observables into unsharp ones, and 
commutative pairs of observables into non-commutative ones, which implies a cer- 
tain loss of information. However this loss of information allows for fundamentally 



different quantum representations of a classical system, as seen in Section |5.1 
Consider an observable 

r : Bt{n) ^ L\Q) 

which defines how our quantum system with Hilbert space Ti imperfectly represents 
the classical system with phase-space Q. This maps also represents every observ- 
ables of the classical system by an observable of the quantum system. Indeed, 
consider a classical observable 

TT : L\Q) L\Q') 

which is also simply a stochastic map between the two classical systems Q and Q'. 
This observable is represented by the quantum observable 

X = vror. 

Measuring X amounts to first measuring F, and then measuring tt on the classical 
system Q representing the outcome of the quantum measurement. 



Therefore, to the classical limit T is associated the set of observable of the form 
X = TT o r, for all classical observables tt. Note that this is simply the set Pr of 
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observables preserved by T. 

Bt{n) L\n) 




L\n') 



We will also say that these observables are coarse- grainings of F since they 
amount to measuring F and then forgetting about some aspects of the classical 
result by applying the stochastic map tt. In this sense, each of these observables 
represent less information than F. 

We will take this to be the prototype of a classical set of observable. In fact we 
will say that a set of observable is classical if it belongs to such a set, i.e. if they 
are all coarse-grainings of a single observable F. 

Definition 5.1. We say of a set O of observables that it is classical if there is an 
observable F and a family of stochastic maps vr^, X G such that 

X = nxoT for all X eO. 

This is equivalent to saying that the observables X E O are all preserved by F, i.e. 

O C Pr. 



Note that it would be tempting to characterize this set of observables by a set of 
effects, i.e. the set of coarse-grainings of F which take value in the set VL' = {0, 1}. 



But the counter-example used in Section 3.1 shows that this is not possible 



As an example, consider the set of observables defined by a commutative sub- 
algebra A C Bt{T-C). In this case F can be chosen to be any sharp observable whose 
spectral projectors generate the commutative sub-algebra A. More concisely, F can 
be represented by a self-adjoint operator F, whose bounded functions generate A. If 
F takes value in Qr, i-e. T : A^ ^ L^{Qr), then the dual map F* is a homomorphism 
of L°°{Qr) onto A. This means than a sharp observable X represented by a self- 
adjoint operator X G ^ is a coarse-graining of F in the following way. We know that 
there is a real function / such that X = /(F). The function / is a mapping from 
the spectrum of T to that of X. This means that the effects X*{xuj) of X satisfy 
X*{a) = T*{ao f). We then define the stochastic map vr by 7r*(a)(x) := for 
all X G fir, so that we have X*{a) = F*(ao/) = F*(7r*(a)) for all a, i.e. X = vroF. 

For instance, if F is sharp and discrete with elements F^ = Fj, then all the 
operators in the algebra F that it generates are represented by self-adjoint operators 
of the form X = Yli(^nr ^i^i- then clear that measuring F is as good as 

measuring the observable represented by X. All that one has to do to simulate a 
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measurement of X is to forget the distinction between the outcomes i and j in the 
case that Aj = A^, an operation which can be represented by the stochastic matrix 
71 defined as follow. Let fl be the set of distinct values taken by the coefficients Aj 
for all i, which is the spectrum of X. The stochastic matrix vr is defined by nix^ = 1 
for all i E fir and for all other components. 

5.3.1 Non-contextuality 

Our definition of a classical set of observable is further justified by that fact that 
they represent an effective theory which can be simulated by a non-contextual 
hidden variable model, in the sense defined in [53] . 

Suppose that we have a physical system defined by a set of observables O 
and a set of states §. A non-contextual hidden variable model associates to each 
observable X E O a. classical observable hx, and, to each state p G S, a classical 
states /ip, in such a way that, when combined, these classical states and observables 
yield the same probability distributions as the quantum states and observables they 
represent. 

The idea is that the model would be contextual with respect to preparations if 
the classical state was a function not only of p but also of the observable X, which 
would mean that the particular state we must choose depends on the particular 
experiment that we are going to perform. Correspondingly, it would be contextual 
with respect to measurements if the classical observable to choose depended on the 
quantum state. 

It is known that quantum theory cannot be given a non-contextual model. How- 
ever, a classical set of quantum observables can. Indeed, this is precisely what the 
observable F which characterizes our classical set does. Indeed, it maps any quan- 
tum state p to a probability distribution fip = T{p) G L^{Q). In addition, each 
observable X of the classical set is associated with a classical observable ttx which 
is such that X = ttx o F. This relation implies that the probability distribution 
X{p) is equal to one given by the effective classical theory, i.e. nxiPp)- Indeed, 

7rx(^p) = 7rx(F(p)) = X{p). 

5.3.2 Functional coexistence 

There exists a concept of coexistence between POVMs [54j which generalizes the 
notion of commutativity for PVMs. 

Remember that an observable X defines a map X* from classical effects to 
quantum effects. When this map is restricted to sharp classical effects, which are 
all of the form Xuj for some subset u C Q of the phase-space Q, then it defines a 
POVM, which is a map from subsets u to quantum effects. The effects of the form 
X*{xuj) can therefore be said to be in the range of the POVM. A set of POVMs 
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is called coexistent if the union of their range are all within the range of a single 
POVM. To show that the observables in our "classical" set are coexistent, we first 
need to prove a Lemma. We say that an observable X is a marginal of 

Y : Bt(n) L\ni) (g) L\n2) = x Q2) 

if X*{a) = Y*{a ® 1), which means that measuring X amounts to measuring Y 
and then discarding the Q2 component of the result. The following then is true: 

Lemma 5.1. Observables in a classical set are all marginals of a single observable. 

Proof. Suppose that we have a set of observables O, which are all coarse-grainings 
of r : B(Ti.) L^{Q). For clarity, we will label the observables in O by a set S, 
i.e. O — {X„ I a e S}. For all cr e E there exists a channel tTo- : L^{Q„), 
where Qa is the set of values of X^, such that X^- — TTaoT. Consider the channel 

copy : L\n) ^ (^L\n) = L^(J]n) 

E S 

defined by copy(/x) = fx. This channel produces as many copies of the state of 
the classical system Q as there are elements in the set E. We use this to define the 
new observable ^ 

r ((^ Tia) o copy o F 
o-es 

which amounts to measuring F, then duplicating the output as many times as there 
are elements in E, and finally applying tTo- to the cr's copy. It is thenjilear that 
each observable X^ is a marginal of F, namely it amounts to measuring F and then 
discarding all but the cr's component of the outcome. □ 

This almost directly shows that all coarse-grainings of a single observable F 
are coexistent. Indeed, using the same objects as in the above proof, we have 
-^aiXoj) — r*(x2;), where uj contains all the elements of Yij: ^ which have their cr's 
component inside cu. 

In fact, this shows that the coarse-grainings of F are more than just coexistent. 
They are also functionally coexistent, which means that cu is related to a; by a 
function / : Hs ^ ~^ ~ /~^(^)- Here the function is f{x) — x^, and 

/^^(cj) denotes the pre-image of uj under /. In fact, it is also clearly true that 
functionally coexistent observables form a classical set. Therefore we have the 
following: 

Proposition 5.2. Observables form a classical set if and only if they are function- 
ally coexistent. 

5.4 Decoherent channels 

Equipped with our definition of a classical set of observables, we can now define a 
fully decoherent channel S as one whose preserved set of observables P£: is classical. 
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CPr 



for some observable F. The observable F then is the pointer observable character- 
izing the decoherence process. 

Definition 5.2. A channel S is said to be fully decoherent in terms of the pointer 
observable F if all its preserved observables are coarse-grainings of F. 

It may be convenient to visualize this definition by combining the commutative 
diagrams which state that X is preserved by both £ and F: 



This can be read as follows. Imagine that the channel 8 and the classical limit 
F are given, and disregard the diagonal X. The condition for F to be a pointer 
observable selected by E is that for every observable Y, n exists to complete the 
square. 

In general, if P^- 7^ Pr, the pointer observable F is not unique. For instance, it 
could be that the channel preserves no information at all, which would mean that 
the preserved observables are only those whose effects are proportional to the iden- 
tity. In this case, absohitcly any observable could serve as the pointer observable, 
according to the above definition. The idea is that in this case any observable would 
yield a consistent classical interpretation of this information, albeit admitting only 
maximally mixed quantum states. In general, the description based on a pointer 
observable F contains more information than what has actually been preserved by 
the channel. Therefore, the arbitrariness in the choice of the pointer observable cor- 
responds to the arbitrariness of the added information in the classical description. 
In fact we may expect that, in reahstic systems, the differences between various 
choices of the pointer observable vanish when the accuracy with which the observer 
can resolve the classical observables is small compared to h. 

5.4.1 Example: entanglement-breaking channels 

As an example, consider all channels with rank-one elements, i.e. of the form 




TT 
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where the states and are arbitrary, apart from the condition that S*{1) = 1, 
which imphes that 

5^A#,)(0,| = 1. 

i 

This is the general form of the entanglement-breaking channels in finite-dimensions 
[55]. Channels of this form are fully decoherent, with discrete pointer observable F 
defined by its elements 

Ti = Xi\(f)i){(f)i\. 
Indeed, an observable X preserved by S has the form 

X*{a) = Y,\\<P^){'^i\Y*{aM){<P^\ 
i 

for some observable Y. Therefore, 

i 

where the classical channel tt is defined by 

7r*{a), = {ij,\Y*{aM). 

To see that this map tt is a valid channel, note first that it is manifestly positive 
and unital. In addition, it is the dual of the map 

7r(/x), = r(5^/x,|^,)(^,|) 

i 

on states, and is therefore normal. Note that such channels were already studied in 



Section 3.4 In particular, the sets of effects represented in Figure 3.2 are examples 



of classical sets. 

If the sum is replaced by an integral over a continuous set of states, the resulting 
channel has the same properties. An example of such a channel is given in Section 
[5A31 below. 



5.4.2 Covariant decoherent channels 

The above considerations about the freedom in the choice of the pointer observable 
r can be made more concrete in the case where the channel S is covariant with 
respect to a unitary representation U : g E G ^ Ug of a compact Lie group G. The 
channel S is said to be covariant with respect to this group if there exists another 
unitary representation V : g ^ Vg which is such that, for all g ^ G, 



EiUgpu;) = Vg£{p)v;. 
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Now suppose that this channel is fully decoherent, in the above sense, in terms 
of some pointer observable 

To : BtiU) ^ L\n) 

This means that for all observables X there exists a classical observable ttx such 
that for all effects a, 

but then we also have, for all g & G, 

£*iX*ia)) = £*iV;VgX*ia)V;Vg) = U;£*iVgX*ia)V;)Ug 

= u;T*in;Aa))Ug 

where, by gX we mean the observable defined by 

(gXyia) ■.= VgX*ia)V: 



Equation 5.10 means that the preserved observables are not only coarse-grainings 



of r, but are also coarse-grainings of any observable of the form 

r*(a) := U;Tl{a)Ug. 

This shows an example of ambiguity in the choice of the pointer observable. How- 
ever, it may be that the pointer observable F is such that Tg is equivalent to F up 
to a relabelling of its values Q. This would mean that F itself is covariant with 
respect to G in the sense that there exists an action A of G on f2 which is such 
that Tg{a) = Fo(a o A^). In fact we can always build a pointer observable F which 
is covariant in this way. It is, in a sense, and average of each F^ over the Haar 
measure. More precisely, we define F by 

r*(a) := J Tl{ag)dg = J U;Tl{ag)Ug dg. 

where a G L°°{G) L'^{fl), (yg{x) := a{g,x) for all x E fl. The invariant Haar 
measure dg on G is normalized such that J dg = 1. This observable is covariant for 
the group G with respect to the action X on G x Q defined by the right-action 

Xhig,x) := {gh-\x) 

for all h E G and x E Q. Indeed, we have 

U:ria)U, = I U;,Tliag)Ug,dg = J U;Tliag,-^)Ug dg 
= ju;ri{agoXf,)Ugdg = T{aoXj,). 

We still have to show that, indeed, all observables preserved by S are coarse- 
grainings of F, so that it can serve as a pointer observable. Given an observable X 
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taking value in the set fix, 

£*{X*{a)) = I £*{X*{a))dg 



u;rin;M)u,dg 
= r(ni(a)) 

where we have defined the classical observable 
by 

U*x{a){g,x) := 7r*^(a)(x). 
5.4.3 Example: coherent states 

Let us give an example of a channel decoherent in terms of an unsharp pointer 
observable. We will use the setting of a Hilbert space Ti. which contains coherent 



states l^;) = \q,p) generated from a fiducial vector \ipo), as in Section 5.1.1 We 
consider the channel S defined by 

S{p) = / \z){z\p\z){z\dz (5.11) 



or 



£*(A) = / \z){z\A\z){z\dz. 



This channel is of the form of the class of examples studied in Section 3^, apart 
from the fact that the sum has been replaced by an integral. An observable X 
which is preserved by this channel is of the form 



X*{a) = j \z){z\Y*{a)\z){z\dz. 
for some observable Y. It is immediately clear that this observable is a coarse- 



graining of the coherent-state observable T studied in Section |5.1.1[ Indeed, re- 
member that 



r*(a) = / a{z)\z) {z\dz. 

Hence, we have 



X*{a) = J {z\Y*{a)\z)\z){z\dz = J {TT*{a)){z) \z){z\dz 
= r(7r*(a)). 
where we have defined the stochastic map tt by 

7r*{a){z) = {z\Y*{a)\z). 
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Note that vr* is positive and unital. In addition it is normal, because it is the dual 
of 

ni^^)=Yi[ f^iz)\z){z\dz) 



which is well-defined given that p := f p{z)\z){z\dz is trace-class. Indeed, for a 
basis we have X]j(^lpK) — I f^i^) = 1. 

Therefore we have in this example that P^- C Pp. However, we do not have 
equality between these sets. Indeed, T itself is not preserved by the channel. In- 
stead, the closest we can get is an observable F of the form T ■.= r o S which, more 
explicitly, has the form 



r.(„).rrw)^yaW|.>(.|.'>(.1.>(.M.<*.' 

a{z) \ {z\z')\'^\z){z\ dzdz'. 



This observable can be understood as an approximate form of the observable F. 
For instance, if the fiducial state IV^o) that we used to define the coherent states is 
a Gaussian then so is which shows that if a{z) is peaked around z, then 

F*(a) is given by a Gaussian smearing of coherent states around z, instead of the 
coherent state \z){z\ itself. However, F itself is not sufficient to characterize P^-. 
Indeed, Pp does not contain the whole of Pf. 

Note that this channel S is clearly covariant with respect to the group of trans- 
lations and boosts in phase-space, which was used to generate the coherent states. 
However, we cannot use this fact to generate other pointer states from F, given 
that F is already covariant with respect to this group. 

In conclusion, the channel defined by Equation 5. 11 [ yields an example of how a 



realistic classical limit can emerge dynamically from decoherence. We note that the 
emergence of phase-space through decoherence has been studied before [15], |9] , how- 
ever this is the first model which allows in principle for the possibility of a derivation 
of the structure of the emergent phase-space. Indeed, all other approaches attempt 
to show that a given classical description is consistent, without excluding other 
possible classical descriptions. Although our description may also have a certain 
ambiguity in the choice of the pointer observable F, the main properties of the 
classical description should come from the features encoded in the well-defined set 

Another advantage of this picture is that it enables us to study the transfer of 
information to the environment, as we will show in the next section. 



5.5 Decoherence from broadcasting 



In Section 3.3, we showed that sharp preserved observables which are duplicated 
twice must form a classical set. We exploited this fact in Section 5.2 in order 
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to provide a model of decoherence in terms of a sharp pointer observable. In 
the previous section we familiarized ourselves with the idea of an unsharp pointer 



observable. We will now try to generalize the result of Section |5.2| to unsharp 
observables in order to have a full-fledged picture of decoherence. However, we will 
see that in the case of unsharp observables, it is not sufficient to duplicate them 
twice in order to make sure that they form a classical set. A counter example will 



be provided in Section |5.5.2[ Instead, we will show that the unsharp observables 
which are duplicated an arbitrary number of times must form a classical set. Our 
proof however will assume that we are dealing with a finite-dimensional quantum 
system. 

We consider a channel S from the system with Hilbert space Ha to an infinite 
number of subsystems Hbj^,'Hb2,'Hbs, ■ ■ ■ of the environment. 

£:BtinA)^i3tinB,®nB,(S)...) (5.12) 

If needed, the infinite tensor product Hilbert space can be made separable by keep- 
ing only basis elements with a finite number of factors different from a special state 
selected in each Tf^. [56]. The flow of information from A to the ith system Bi is 
described by the channel 

Si : Bt{HA) BtiHB^ 

deflned by 

£iip)=Tr'Ap) (5.13) 

where Tr'^ is the partial trace over all subsystems but the ith. We want to charac- 
terize the information which is preserved by all these channels at the same time. 
This information is encoded in the set of observables which are preserved by all the 
channels: 

i 

We will say that the observables in this set are broadcast. We can show that any 
countable set {Xi} C X is classical, in the sense that all the observables in it 



are coarse-grainings of a single observable (see Section 5.3). Indeed, let Yi be the 
observable such that Xi = YiO Si, which exists thank to the assumption that Xi is 
preserved by Si. Then consider the observable 

T ■.= {Yi0Y20...)oS. 

It is clear that all the observables Xi are marginals of F. Indeed, measuring F 
and then discarding all classical output subsystems but the ith is equivalent to 
measuring YioSi = Xi. This proves that {Xi} C Pr- Note that any coarse-graining 
or convex combination of these observables Xi is also in Pp. 

If the set X was separable with respect to a certain topology T, then we would 
have that {X^} can be chosen to be dense in X, so that 

X C P^ 
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where the horizontal bar denotes closure with respect to the topology T. This would 
show that a single observable F suffices to simulate the broadcast observables in X 
to arbitrary precision (with respect to T), and therefore that the set I is classical, 
and characterized by the pointer observable F, in this somewhat more general sense. 

The set I can be made separable under the following assumptions 

1. It includes only observable taking value in a single set Q. For instance, one 
could pick the disjoint union i7 = M U N in order to include both continuous 
and discrete observables. 

2. The source Hilbert space (on which the observables in I are defined), is 
finite-dimensional . 



The finite dimensionality of Ti.A in fact guarantees that the whole set of observables 
of the form X : Bt{li.A) — * L^{VL) is separable. Indeed, consider a basis cTj of BtiTi). 
Each observable X is entirely characterized by its components X{ai) G L}{VL). 
Given that there is a finite number of them, and that L^{VL) itself is separable, the 
set of all these observables is also separable. The topology T which is involved here 
is defined by the metric 

rf(X,F) =max||X(ai)-r(a,)||i (5.14) 

i 

where || ■ ||i is the norm in L^(f2). 
This proves the following: 



Theorem 5.3. Let £ he a channel of the form specified by Equation 5.12 , with the 
source Hilbert space Ha finite- dimensional. Then the set Iq of observables with 
values in a given measure space Q, which are preserved by all the partial channels 
Si, can be approximated arbitrarily well by coarse- grainings of a single observable 



T, in terms of the distance defined by Equation 5.14 



We will look at the consequences of this result in Section 5.6 But ffist, let us 



examine the question of the correlations induced by unsharp preserved observables. 



5.5.1 Correlations 



In Section |5.2.1| we showed that a sharp observable which is duplicated twice im- 
plies the existence of exact correlations between the two destination systems. For 
unsharp observables, the correlation will in general not be exact. However, there 
are cases where the correlation is exact even for genuinely unsharp observables. To 
show this, let us give a slightly more general form of the result derived in Section 

Consider a discrete observable X with effects Xj. Suppose that for each i, Pj is 
a projector satisfying 

P^XP^ = 0. 
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We will assume that this observable X is preserved by two channels Si and 82 
which are given by two different partial traces of the same channel S. For instance 
they could be any pairs of channel Si defined in Equation 5.13 Hence, there exists 
observables Y and Z which are such that Xi = Sl{Yi), Xi = E^iZi) for all i. 

We have 

X, = r(r,® 1) =r(l®Z,) (5.15) 
If we multiply by on both sides in each term, we obtain 

= Pte^iji ® \)Pt = ® Zi)p^^. 

If Ek are the elements of the channel S, then using Lemma [3l2 we get 

{Yi ® l)Ek = (1 ® Zi)Ek = EkPi 
Together, these two equations yield 

(F, ® Zj)Ek = (F. ® Zj)Ek{P^P,r 

where n is an arbitrary integer. In the limit for n — > 00, we get a new projector 
Pij on the right-hand side which projects on the intersection of the subspaces cor- 
responding to Pi and Pj. Hence, if our observable X is such that Pi and Pj project 
on subspaces with trivial intersections, then 

(Y, ® Zj)Ek = (F. ® Z,)Ek6ijPi 

which implies that whenever i ^ j, 

S*iY,(E)Z,) =0. 

Hence Y and Z are fully correlated. Note that we had to assume that the supports 
of the effects Xi all have trivial intersections. This happens for instance if Xi oc 
\ipi){ilji\ and the states \ipi) are all distinct (but not necessarily orthogonal). 

More generally, we do not expect that a duplicated unsharp observable will 
always induce exact correlations between the target systems. However, we expect 
that the amount of correlations between Y and Z is related to the information 



content of the observable X, as defined in Section 2.1.3 



5.5.2 Non-classical two-fold broadcasting 

We showed that sharp observables must form a classical set when duplicated twice. 
For unsharp observables however, we showed that they form a classical set under 



the assumption of an infinite quantity of copies (Theorem 5.3). Can we arrive at 
the same conclusion under a weaker assumption, in the unsharp case? Here we 
give an example which shows that, unlike for the sharp case, two copies are not 
sufficientEl 

^This example was suggested to the author by A. Winter. 
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Consider the isometry V which embeds a three dimensional Hilbert space H 
into the antisymmetric subspace of the tensor product (g) as 

ijk ^ 

where eijk is the totally antisymmetric tensor (in three dimensions), and i G 
0, 1,2 arc an orthonormal basis of Ti, and := \i) (E) If we trace out one of 
the space Ti after the action of the isometry, we obtain a channel whose dual is 

S*{A)^V*{A(»1)V 

^ \ ^ (^ijk(^t'j'k'\k){ij\{A^l)\i'j'){k'\ 

ijki'j'k' 

= ^ 5Z ^ijkei'jk'\k){i\A\i'){k'\ 

ijki'k' 

= ^ ^{Sii'Skk' - Sik'Ski')\k){i\A\i'){k'\ 

iki'k' 

-lJ2\^){m){k\-\k)mk){z\ 

ik 

= i(Tr(A)l-A^) 

where is the transpose of A seen as a matrix with respect to the basis By 
construction, it is clear that this map is completely positive and unital, and that 

since tracing-out the other destination subsystem yields exactly the same channel. 
Therefore, the whole set of observables preserved by S is also preserved by Sc and 
is therefore duplicated. 

Let us suppose that the set of preserved observable is classical, i.e. that there 
exists an observable T such that all preserved observables are coarse-grainings of T. 
We want to reach a contradiction. 

First, let us show that the channel £ is covariant for the whole unitary group 
U{Tt). Indeed, for any unitary U on H we have 

S*{U*AU) = J(Tr(C/*^C/)l - {U*AUf) 
= -(Tr(^)l - U^A^TJ) 

= -(Tr(>l)l - U^A^U) 

^U*£*{A)U 
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where U is the complex conjugate of U, which is also a unitary operator. In fact, 
g i-^ Ug is also a representation of the group. 



As seen in Section 5.4.2 this implies that there is a covariant pointer observable 
r, defined by 

T*{a) = j U*T*{au)UdU. 

where ajj{x) := a(f/, x) for all U E UilH.) and x G fi, the set in which V takes value, 
this observable F is covariant in terms of the representation A on UiTi) x Q defined 

by 

XuiV^x) = iVU*,x) 

for any unitary operators U, V and any x E Q. 

In fact we can build a covariant pointer observable which is more generic that F. 
First, note that each matrix element (i|F*(a)|j) defines a normal linear map from 
L°^(f2) to C, i.e. it can be associated with an element of the pre-dual L^{fl) 
such that 

{i\T*{a)\j) = / fiij{U, x) a{U, x) dx. 
Jq 

For each x we can build the matrix 



Tx := '^fiij{x)\i){j\. 



We then have 



r*(a) 



a(x)Tx dx 



The operators F^^ are the continuous equivalent of the elements of a discrete POVM. 
If we diagonalize these operators we obtain F^. = Af |i) (zj^,. where A^ is the 
eigenvalue of F^. with eigenstate We pick unitary operators which are such 
that 1^)2; = U^\0), where |0) is some fixed arbitrary state. 

Recall that given our assumptions, for each observable X there is a stochastic 
map TT such that £*{X*{a)) = F (7r*(a;)). Expanding the right-hand side of this 
equation, we obtain 

£*{X*{a)) = [ U*T*{{n*{a))u)UdU. 



{Tr*{a)){U,x)U*T^UdUdx. 

(tt* (a)) (f/, x) J2 >'^U*U^\0) (0 1 {U^yUdUdx 

i 

J2i^*i(^))iUiV,x)X^V*\0){0\VdVdx 



J2 f{7r*{a)mV,x)XUx 

i 

A*(n*(a)) 



V*\0){0\VdV 
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where we have defined the observable 

A : Bin) L\U{n)) 

by 

A*(a) ■='i j a{U)U*\Q){Q\U dU 
and the stochastic map (or classical observable) 

n : L\u{n)) ^ L\nx) 

by 

Since all preserved observable must be coarse-grainings of A, then in particular, 
we must have that 

£*{\i){i\) = 1(1 - \i){t\) = 3 [ a{U) U*\()){Q\U dU. (5.16) 



2 

for some effect a in L°°(U{H)). But this is not possible. Indeed, we would have 



a{U) {i\U*\0){0\U\i) dU = 
which implies that for all U, either a{U) = or {0\U\i) = 0. Hence the integral 



in Equation 5.16 is supported on the set of unitary operators U which are such 
that {0\U\i) = 0. But this set is of measure zero. This means that we must have 
— 0) which contradicts the hypothesis. Therefore the set of preserved 
observables in this example is not classical, even though it has been duplicated 
twice. 



5.5.3 Example: symmetric broadcasting 

We consider the optimal fully symmetric quantum cloning machine introduced in 
|57] . This is a channel which approximately copies n to m qubits without discrim- 
ination in favour of any state, and with optimal fidelity. In the case n = 1, each 
individual channel SjT'^ resulting from tracing out all destinations qubits but one, 
is of the form ^ 

^l"'\p) = "mP + (1 - "m) Tr(p)-1. 

In the limit of an infinite number of copies (m oo), the parameter a tends to 
« = 

Skip) :=4°°Hp) = ^P+^Tr(p)^l. 
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Since these channels are identical for every k, the information preserved by one of 
them is also preserved by all the other channels. Therefore I = P^-^. for any k. Note 
that is invertible. This means that we can apply Proposition 3.1, which states 



that the observables preserved by Sk are those whose effects are all preserved. 

We will show that the preserved effects Sl{E(H)) are all coarse-grainings of 
any of the symmetric informationally complete (SIC) POVMs [SB]. A POVM is 
said to be informationally complete if its elements Xj span the space of density 
operators, which implies that its statistics entirely determines the state. Indeed, 
the component of a state p associated with the basis element Xi is given by the scalar 
product Tr(pXj) which is also the probability for the ith outcome in a measurement 
of X. 

For a qubit, such a POVM must have at least four elements. Indeed, the Hermic- 
ity condition on states selects a four dimensional subspace of the space of matrices. 
Note that the condition that the trace be unity, although it does reduce the di- 
mension of the manifold of states to three, does not do so in a linear fashion. An 
informationally complete POVM is further said to be symmetrical if the scalar 
product Tr(XjXj) is the same for all pairs of distinct POVM elements. In addition, 
one usually assumes that the elements Xj are of rank one. 

For a qubit, a SIC-POVM F has elements {Fq, Fi, F2, F3} which are proportional 
to projectors onto four pure states \ipi). If we represent these states in the Bloch 
sphere, i.e. in terms of their traceless components A* = Tr((|'?/'i) — ^l)aj) where 
aj, j = 1, . . . , 3 are the Pauli operators, these states correspond to the vertices of 
a regular tetrahedron inscribed in the Bloch sphere. 

The set of effects preserved by F is 



F*(E(n)) = {^a,Fi|0<a, <1}. 



where Q = {0, 1,2,3}. Let us picture this convex set in the space of Hermitian 
matrices spanned by the Pauli basis {1, o"i, cr2, as}. Note first that the boundary of 
the set E(7Y) of all the quantum effects is a double cone with tips at and 1. The 
intersection of the two cones is the Bloch sphere in the 3-sub-manifold of trace-one 
operators (i.e. with component ^ in the direction specified by the identity 1). The 
subset F*(E(f2)) is made of two pyramids whose tips are also and 1 and whose 
edges are on the surface of the cones and intersect the Bloch sphere where the pure 
states Itpi) are. (This is the four-dimensional version of the n = 3 case represented 



in Figure 3.2 ) 



This set contains the set £^{K{TC)) of effects preserved by S^.: 

Sl{E{n)) C F*(E(r])). 

Indeed, the action of the channel S^. consists in reducing the radius of the two cones 
representing E(7^) by a factor one-third, which creates two smaller cones which are 
precisely inscribed into the two pyramids defining F*(E(f2)). This fact follows from 
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knowing that the largest sphere inscribed in a regular tetrahedron (which is the 
shape of the base of these four-dimensional pyramids) has radius equal to one-third 
the distance from the center to any of the vertices. 

Hence all the effects preserved by S are also coarse-grainings of F. Since the 
operators Fj are linearly independent, F* is invertible, which implies that any ob- 
servable made of effects preserved by F is also preserved by F. Hence, any observable 
which is preserved by the channel £k has its effects in £l(E{H)) C F*(E(f2)) and is 
therefore a coarse-graining of F. 

This shows that for this example, 

for any SIC-POVM F. Note that all the SIC-POVMs are given by applying an 
arbitrary unitary transformation to any given one. In this example, the channel S 
is covariant with respect to the whole unitary group. Indeed, 

£{UpU*) = ^-UpU* + hr{p)^UU* = U£{p)U*. 

This is the reason why any unitary transformation of our pointer-observable F is 
also a pointer observable. 

5.5.4 Example: iterated interactions 

Consider an interaction defined by the isometry V : H ^ H ^He, where He de- 
notes a subsystem of the environment, and 7i is finite-dimensional. This interaction 
can be iterated in a way which adds a new subsystem of the environment at each 
iteration. For instance, the third iteration is given by the isometry 

y{3) ■.n^n®nE,®nE^® Hes 

where Hes —'He^ —'Hei =Ti-E, which is defined by 

An example would be a system colliding with different particles, one after the other. 
The system He^ corresponds to the nth particle it has interacted with. 

The channel S describing the evolution of the system during each interaction is 
given by 

£*{A) :^V*{A^1e)V 

and has for complement 

S*{B) := V*{10B)V. 
These two channel can be used to describe the channel 
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from the initial state of the system to the final state of the kth particle of the 
environment: 

where, by S'^~^ we mean k — 1 iterations of the channel S. It is clear from the 
definition that the observables preserved by are also preserved by S"^ whenever 
m < n. Therefore, the observables preserved by £rn are also preserved by £n 
whenever m < n. This implies that the observables X preserved by all the channels 
Sn, k < n are also preserved by Sk = £^~^. What is 1 in the limit k — > oo? 

Let us assume that Ti is finite-dimensional. Then we know [39] that there is an 
increasing sequence of integers {/cj}^;^ C M which is such that in the limit 2 — *• oo, 
[E^^Y tends to a projective unital completely positive map V which projects on the 
fixed point set Fix 6^*. This shows that the observables preserved by the channels 
E'^ for all /c G N are those which are fixed by E^ i.e. such that 

X = XoE. 

Note that this is a case where the preserved observables are entirely characterized 
by the preserved effects. 

An effect A which is preserved by all the channels /c G N must be such that 

A = v{E:m 

for some effect B. This implies in particular that A G Fix£^*. But what can we 
deduce about E*{B)1 

For simplicity, let us suppose that E is unital, i.e. E{1) = 1. Then we know 
[59], that Fix£^* is a *-algebra, which we will call A. Clearly it is also trivially a 



correctable algebra for E. Therefore we know from Corollary 3.8 that all the effects 
in E*{K{TCe)) must belong to the commutant A': 

E:{E{nE))cA'. 



We have seen in Section 4.3.1 that the center Z{A) is characterized by a com- 
plete orthogonal family of subspace Tik, which we will associate with isometrics Vk 
and projectors Pk = VkV^. A generic element A G ^ is of the form 



A = j2yk{Ak®ik)v,* 

k 



where Ak^lk is an operator on Hk- Similarly, an element B G E*{E{Ti.E)) ^ A' is 
of the form 

B = j2yk{ik^Bk)v:. 

k 

In addition, since V fixes A, its channel elements Ei must all belong to the 
commutant A' [60]. This can be seen by studying its effects on projectors inside 
A, and using Lemma |3.2[ This implies that it is of the form 

ViA) = J2 Vkilk ® iEir)V:AV,il, ® E})V* 

ijk 
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Hence we can deduce that an effect A preserved by all channels Sk is of the form 

A = v{B) = Vkilk ® {Eimu ® Bk){ik ® Ei)v: 

ik 

= j2v,{U0{EirB,Ei)v: 

ik 

where we assumed that B G S*{K{He)) C A'. This shows that A G A'. But we 
also know that A E A since it is in the image of V. Hence A E An A' = Z{A). 

Therefore, in the case where £ is unital, the observables in X = f]^ P^-^. all belong 
to a commutative algebra: the center of the algebra A = Fix^^*. 



5.6 Discussion 



Let us summarize the picture of decoherence suggested by Theorem |5.3[ We con- 
sidered a quantum system undergoing an open evolution, i.e. interacting with its 
environment. Nothing forces the information preserved in the system to be clas- 
sical, unless the evolution is a "fully decoherent" channel of the form studied in 



Section 5^ However, we have seen that the information which is broadcast redun- 
dantly to the environment is characterized by a classical set of observables T C Pp. 
This process is the physical realization of a measurement of the pointer observable 
r on the system by the environment. The redundancy guarantees the objectivity 
of the information stored in the environment, as argued in [TU]. Furthermore, if 
this information is also preserved in the system itself during the interaction, then 
the information contained in the environment is correlated with that contained in 
the final state of the system (even if it evolved). Therefore this information has 
predictive power and can characterize the state of a deterministic effective classical 
model of the system. 

In this picture, the pointer observable can be defined even if it does not represent 
information which is preserved in the system. In this case, the emergent classical 
degrees of freedom represent a classical system subject to noise. Alternatively, the 
process of decoherence in this situation can be seen as a destructive or partially 
destructive measurement. 

Note that if T = Pr this process represents an exact measurement of F by 
the environment, whereas X C Pp implies that the environment does not gain full 
information about F. In this case, F represents only an approximate description of 
the information present in the environment. This also means that there exists an 
ambiguity in the pointer observable due to the arbitrariness of the extra information 
contained in Pp. 

Finally, we observed that even if the measurement of F by the environment is 
complete, the corresponding classical limit may have some forbidden states. In- 
deed, if F represents an approximate simultaneous measurement of non-commuting 
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observables, the classical states in its image are constrained to respect the corre- 
sponding uncertainty relations. However we expect these quantum constraints on 
the effective classical states to be irrelevant for phase-space observations coarser 
than h. 



5.6.1 Universality of classicality 

The main advantage of this picture is that it relies on very little physical assump- 
tions. For instance, no particular model of interaction has been assumed. We 
did however, make one rather unusual assumption which requires some comments, 
namely the idea that only redundant information matters at macroscopic scales. 
This idea seems intuitively reasonable as redundancy of information evokes the 
amplification of a signal. In addition, we cannot think of any classical degree of 
freedom which is not represented redundantly in a way or another, given that large 
systems are always made of many particles which share the essential properties of 
the object. As an example, we can think of the position of a rigid body. Certainly, 
knowing the position of any of its molecule would suffice to localize it. 

In fact, since what it means to be macroscopic has never been defined unam- 
biguously, it is tempting to think of this assumption as a mere definition. Indeed, 
we could say that macroscopic physics is that which deals with highly redundant 
information. In this view, that which is "macro-" in "macroscopic", i.e. large, is 
the number of equivalent representations of the same information. If we accept this 
postulate, then Theorem |5.3 shows that macroscopic information is always classi- 



cal, hence making the universality of classicality at macroscopic scales a tautology. 
A problem with this view is that the our notion of redundancy relies entirely on a 
preferred decomposition of the environment into subsystems. For our interpretation 
to make sense, the decomposition should be such that a classical observer is only 
able to measure observables which are local with respect to this decomposition. 

This view stands in contrast to the standard approach on this question, which 
consists in the study of realistic models under a class of assumptions as broad as 
possible, with the aim of showing that they are indeed fully decoherent, in the 
sense that their evolution destroys all but classical information [13]. However such 
studies have a point: decoherence is omnipresent even in microscopic experiments, 
which is why building a quantum computer is so difficult. 

These two views may be reconciled if we consider that quantum experiments 
are about the control of a microscopic system by a macroscopic one, which is 
by definition subject to decoherence. Macroscopic systems are those which are 
extremely efficient at spreading information. It is therefore no surprise that when 
put in contact with quantum systems, classical instruments cause a decoherence 
which may otherwise be absent. 
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5.6.2 Outlook: dynamics and constraints 

To conclude, we will say a few words about the possibility of integrating dynamics 
into our model of decoherence, which is an open question. 



Let us consider the "broadcasting" channel £ defined by Equation 5.12 As 



already explained in the beginning of Section 5.6, in order to obtain a dynamical 
effective classical theory, we need to assume that the information represented by 
the set X is also preserved in the system. We will suppose that the evolution of the 
system itself is given by the channel Sa- This channel represents only the relation 
between two specific moments in time. However, if S and Sa are obtained from a 
unitary interaction generated by a Hamiltonian, they naturally depend on the time 
at which we decide to trace out the system or the environment respectively. In 
fact, we need not trace-out both at the same time. For instance, we could use the 
channel S as it is at asymptotic times, assuming that the environment permanently 
keeps a record of the relevant information about the system. 

Recall that the channel S to the environment selects a classical model charac- 
terized by the set of observables X, or its approximate representation given by the 
pointer observable F which is such that Pr ^ X. We suppose that, in addition, 
SA{t) preserves I for all time t. This means that for any observable X G X, there 
exists a family of observables Y{t) which are such that X = Y{t) o The 
time-dependant observable Y{t) represents how the information about the initial 
property X evolves in time. A problem is that in general, Y{t) has no reason to be- 
long to the set X or to its idealization Pp. Therefore, the map X i— Y{t) cannot be 
understood as a "classical" transformation within X in any straitforward manner. 
This indicates that, in order to be able to compare observables at different times 
and identify an effective classical dynamics, some extra assumptions or inputs are 
needed. 

In principle, what we expect is that the environment is continuously monitoring 
the same observables of the system. However it is not easy to see how to model this 
phenomenon in our framework. Indeed, the time-dependant channel S{t) from the 
system to the environment only describes the information that the environment at 
time t contains about the initial state of the system. Of course we could instead 
consider discrete successive measurement of the system by different parts of the 
environment, which should in fact be close the what really happens. However, this 
would seem to remove one of the advantages of our picture, namely its ability to 
describe a continuous decoherence process. 

We will leave these questions open for further work. For now, let us introduce 
another possible approach, which consists in considering time as a variable of the 
system with no special status. 

There are two ways of interpreting the phase-space Q oi a classical system. It 
can be seen as representing the state of the system at a given time, but it can also be 
understood as representing all physically allowed histories of that classical system. 
Indeed, for simple unconstrained systems, the elements of Q are one-to-one with 
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possible time histories. If, in addition, the system described by the phase-space Q. 
contains a physical clock, then the external time becomes superfluous and can be 
considered as gauge. 

The same should be true in quantum mechanics. A quantum state ^ Ti 
can be understood as describing our general knowledge of the system, without any 
assumption about time. This is particularly apparent in the Heisenberg picture, 
in which time simply labels the possible observations. This point of view yields a 
different light on a classical limit of the form F : Bt{7i) L^(fi), which now maps 
quantum histories to classical histories. 

Let us therefore suppose that we have such a "timeless" quantum system de- 
scribed by the Hilbert space Ti. Since our decoherence framework does not make 
any explicit mention of time, we can still consider an environment B with subsys- 
tems Bi,B2, . . . and a channel 

which represents how information about Ti happens to be contained in the envi- 



ronment, and expect T to emerge as described in Section [53 

In a sense, the map F directly takes into account the dynamics. Note however 
that the classical set of histories Q may need to be larger than what we really 
expect. Remember that in our picture the limit /i — > is taken only at a later 
stage, when the constraints on the classical states L^{Q) imposed by the quantum 
uncertainty relations are ignored. In the case where the limit of whole quantum 
histories is taken into account, it is likely that the classical limit that we get for 
finite h, which is characterized by F, yields mixtures of trajectories which differ 
by powers of h, and hence explore paths which are not characterized by the ideal 
classical phase-space. 

There is another problem with this picture, which is that, for realistic interacting 
theories, the structure of the set of physical solutions is never fully understood. 
Therefore we do not know the space Ti. to start with. In fact, this is what equations 
of motions, or more generally field equations, are about: they allow one to express 
local properties of the physical system without having to understand its global 
behaviour. Equations of motion are local in time, while full field equations are also 
local in space. 

Note also that field equations do not only describe the evolution of the field in 
time, but may also describe spacial constraints that the field must satisfy. In fact 
we can view the equations of motions themselves as constraint on possible histories, 
which allows to unify the picture. 

This fact is especially vivid in the case of general relativity, where no physical 
notion of time exists prior to solving all of the equations which govern the theory. 
In this sense, if by "dynamics" we refer to the process of solving the equations 
of motion, then the dynamics of general relativity has nothing to do with time. 
Instead, one start from the set of all "kinematical" histories, defined by all possible 
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metrics (and matter fields) on a fixed spacetime manifold, and then solves Einstein's 
equations in order to obtain the "physical" histories. Einstein's equations can 
be interpreted as specifying a constraint on the kinematical histories. In fact it 
is not difficult to formulate any classical theory in a similar way, given that the 
equations of motions can always be interpreted as specifying some constraint on 
general histories of the usual phase-space variables. 

Instead of giving a classical example, let us directly show that the same can 
be done in quantum mechanics. We will follow Rovelli [UTl [U^ and show how the 
dynamics of a non-relativistic quantum particle can be formulated as a constraint, 
implemented as a projector on a "kinematical" extended Hilbert space. Consider a 
single, one-dimensional, non-relativistic particle with standard Hamiltonian 

In the position representation, the time-evolution of a particle with initial state 
ipQ = L^(]R) is given by Schrodinger's equation 

djk^ _h^dhMxl ...... 

^ dt 2m dx^ +<PWmx)- 

This equation can be reinterpreted as an equation on ijj E TCk '■= -Z^^(M^): 

in^^ + + m^ix, t) = 0. 

What this is saying is that if) must belong to the kernel of an operator C on L^(M^) 
which is defined by 

^^=^-£-*'^)- 

where E := ih^^ and p := ih^. Technically, although C has in its spectrum, 
its kernel is empty because the solutions to Schrodinger's equation cannot be nor- 
malized in L^(M^). Indeed, we know that it preserves probabilities in time, which 
means that the square of \il){x,t)\ integrated over x is the same at any fixed time 
t, which implies that the integral over t will diverge. There are various ways to 
go around this problem. Here, let us simply suppose that we are working within a 
finite interval of time. Therefore, we redefine Tix as 

Hk = L\R X [0,T]). 

This does not change anything about the properties of Schrodinger's equation, but 
it has the advantage of putting all the (time- limited) solutions inside TCk- The 
drawback is that C here is not self-adjoint, however it does have a kernel which is 
all we need from it. Let P be the projector on the kernel of C inside H-k- There is an 
isometry V : H ^ Hk such that V*V = 1-^, and VV* = P. Indeed, we know from 
the properties of Schrodinger's equation that the space of solutions is isomorphic to 
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Ti. = L^(]R). The map V* represents the "effect" of solving Schrodinger's equation: 
it projects on the space of solutions. 

Let us give an example of a possible classical limit for this dynamical system. 
To this construction, we add Fq, the coherent-state observable on Ti = 



introduced in Section 5A If Fq really defines the classical limit of the theory, we 
may hope that it indirectly maps the quantum evolution defined by the isometry 
V to an effective classical evolution. Let us write V for the channel defined by 
V, i.e. V(p) := VpV* for a state p of H. This channel V can be understood as 
mapping our knowledge of the initial state into a knowledge of the whole history 
of the particle. 

In order to obtain a proposition for the classical limit of the dynamical system, 
we first need to make some assumption about the classical limit of time itself. 
Note that we introduced time as a quantum observable which commutes with both 
the position and momentum operators. Indeed, time is a subsystem of its own 
since Hk = i^^(M) ® L^([0,T]). Let F-r be the map which sends a state p of 
the time subsystem L^([0,T]) into the classical time state fi{t) = {t\p\t), where 
\t) represents the formal eigenstates of the time operator T. Overall we obtain 
an extended classical limit Fx := Fq ® F7-, which operates between quantum and 
classical histories: 

TK:BtinK)^L\R'x [0,T]) 
It simply sends a state p of Hk to the probability distribution 

fJ^iq,P,t) = {{q,p\®{t\)p{\q,p)0\t)) (5.17) 

where \q,p) are coherent states. This map alone has nothing to do with the dynam- 
ics contained in the isometry V. However suppose that the quantum states that we 
use actually satisfy Schrodinger's equation, i.e. p = VpoV*, where po is a state of 
TC. The resulting map 

T:=TkoV 

does not directly give us a distribution over the classical phase-space, but instead 
a probability distribution over classical histories: 

F : Bt{n) L\M.^ X [0,T]). 



Note that, although this map F is technically a valid abstract classical limit, 
the resulting classical system is not deterministic in terms of the variable t G [0, T]. 
This is due to the fact that this picture assumes a process of decoherence which 
does not interfere with the exact quantum evolution of the system. It amounts to 
applying the map Fq on the state of an evolving quantum system independently 
at each time, therefore erasing the information which would be required to predict 
the evolution of an initial state. This shows that this cannot be a realistic model 
of decoherence for this particular system. 
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